Development of Machine Learning Algorithms for Identifying Patients With Limited Health Literacy

IF 2.1 4区医学 Q3 HEALTH CARE SCIENCES & SERVICES Journal of evaluation in clinical practice Pub Date : 2024-11-22 DOI:10.1111/jep.14248

Dylan Koole, Oscar Shen, Amanda Lans, Tom M. de Groot, J. J. Verlaan, J. H. Schwab

{"title":"Development of Machine Learning Algorithms for Identifying Patients With Limited Health Literacy","authors":"Dylan Koole, Oscar Shen, Amanda Lans, Tom M. de Groot, J. J. Verlaan, J. H. Schwab","doi":"10.1111/jep.14248","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Rationale</h3>\n \n <p>Limited health literacy (HL) leads to poor health outcomes, psychological stress, and misutilization of medical resources. Although interventions aimed at improving HL may be effective, identifying patients at risk of limited HL in the clinical workflow is challenging. With machine learning (ML) algorithms based on readily available data, healthcare professionals would be enabled to incorporate HL screening without the need for administering in-person HL screening tools.</p>\n </section>\n \n <section>\n \n <h3> Aims and Objectives</h3>\n \n <p>Develop ML algorithms to identify patients at risk for limited HL in spine patients.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>Between December 2021 and February 2023, consecutive English-speaking patients over the age of 18 and new to an urban academic outpatient spine clinic were approached for participation in a cross-sectional survey study. HL was assessed using the Newest Vital Sign and the scores were divided into limited (0–3) and adequate (4–6) HL. Additional patient characteristics were extracted through a sociodemographic survey and electronic health records. Subsequently, feature selection was performed by random forest algorithms with recursive feature selection and five ML models (stochastic gradient boosting, random forest, Bayes point machine, elastic-net penalized logistic regression, support vector machine) were developed to predict limited HL.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>Seven hundred and fifty-three patients were included for model development, of whom 259 (34.4%) had limited HL. Variables identified for predicting limited HL were age, Area Deprivation Index-national, Social Vulnerability Index, insurance category, Body Mass Index, race, college education, and employment status. The Elastic-Net Penalized Logistic Regression algorithm achieved the best performance with a c-statistic of 0.766, calibration slope/intercept of 1.044/−0.037, and Brier score of 0.179.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>Elastic-Net Penalized Logistic Regression had the best performance when compared with other ML algorithms with a c-statistic of 0.766, calibration slope/intercept of 1.044/−0.037, and a Brier score of 0.179. Over one-third of patients presenting to an outpatient spine center were found to have limited HL. While this algorithm is far from being used in clinical practice, ML algorithms offer a potential opportunity for identifying patients at risk for limited HL without administering in-person HL assessments. This could possibly enable screening and early intervention to mitigate the potential negative consequences of limited HL without taxing the existing clinical workflow.</p>\n </section>\n </div>","PeriodicalId":15997,"journal":{"name":"Journal of evaluation in clinical practice","volume":"31 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11582738/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of evaluation in clinical practice","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jep.14248","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Rationale

Limited health literacy (HL) leads to poor health outcomes, psychological stress, and misutilization of medical resources. Although interventions aimed at improving HL may be effective, identifying patients at risk of limited HL in the clinical workflow is challenging. With machine learning (ML) algorithms based on readily available data, healthcare professionals would be enabled to incorporate HL screening without the need for administering in-person HL screening tools.

Aims and Objectives

Develop ML algorithms to identify patients at risk for limited HL in spine patients.

Methods

Between December 2021 and February 2023, consecutive English-speaking patients over the age of 18 and new to an urban academic outpatient spine clinic were approached for participation in a cross-sectional survey study. HL was assessed using the Newest Vital Sign and the scores were divided into limited (0–3) and adequate (4–6) HL. Additional patient characteristics were extracted through a sociodemographic survey and electronic health records. Subsequently, feature selection was performed by random forest algorithms with recursive feature selection and five ML models (stochastic gradient boosting, random forest, Bayes point machine, elastic-net penalized logistic regression, support vector machine) were developed to predict limited HL.

Results

Seven hundred and fifty-three patients were included for model development, of whom 259 (34.4%) had limited HL. Variables identified for predicting limited HL were age, Area Deprivation Index-national, Social Vulnerability Index, insurance category, Body Mass Index, race, college education, and employment status. The Elastic-Net Penalized Logistic Regression algorithm achieved the best performance with a c-statistic of 0.766, calibration slope/intercept of 1.044/−0.037, and Brier score of 0.179.

Conclusion

Elastic-Net Penalized Logistic Regression had the best performance when compared with other ML algorithms with a c-statistic of 0.766, calibration slope/intercept of 1.044/−0.037, and a Brier score of 0.179. Over one-third of patients presenting to an outpatient spine center were found to have limited HL. While this algorithm is far from being used in clinical practice, ML algorithms offer a potential opportunity for identifying patients at risk for limited HL without administering in-person HL assessments. This could possibly enable screening and early intervention to mitigate the potential negative consequences of limited HL without taxing the existing clinical workflow.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

开发用于识别健康知识有限患者的机器学习算法。

理由有限的健康素养（HL）会导致不良的健康后果、心理压力和医疗资源的滥用。尽管旨在提高健康素养的干预措施可能有效，但在临床工作流程中识别有健康素养有限风险的患者仍具有挑战性。利用基于现成数据的机器学习（ML）算法，医疗保健专业人员将能够进行 HL 筛查，而无需亲自使用 HL 筛查工具：方法：在 2021 年 12 月至 2020 年 2 月期间，对脊柱患者进行 HL 筛查：在 2021 年 12 月至 2023 年 2 月期间，连续接触了 18 岁以上、刚到城市学术性脊柱门诊就诊的英语患者，让他们参与横断面调查研究。HL采用最新生命体征进行评估，得分分为有限HL（0-3分）和足够HL（4-6分）。通过社会人口学调查和电子健康记录提取了患者的其他特征。随后，采用随机森林算法和递归特征选择法进行特征选择，并开发了五种 ML 模型（随机梯度提升、随机森林、贝叶斯点机、弹性网惩罚性逻辑回归、支持向量机）来预测局限性 HL：共纳入 753 名患者进行模型开发，其中 259 人（34.4%）患有局限性 HL。用于预测局限性 HL 的变量包括年龄、全国地区贫困指数、社会脆弱性指数、保险类别、体重指数、种族、大学教育程度和就业状况。弹性网络惩罚逻辑回归算法的性能最佳，c 统计量为 0.766，校准斜率/截距为 1.044/-0.037，布赖尔评分为 0.179：弹性网惩罚逻辑回归与其他 ML 算法相比性能最佳，c 统计量为 0.766，校准斜率/截距为 1.044/-0.037，布赖尔评分为 0.179。在脊柱中心门诊就诊的患者中，超过三分之一的患者被发现患有局限性 HL。虽然这种算法远未应用于临床实践，但 ML 算法提供了一个潜在的机会，可以在不进行亲自 HL 评估的情况下识别有局限性 HL 风险的患者。这有可能实现筛查和早期干预，以减轻局限性 HL 的潜在负面影响，而不会对现有的临床工作流程造成负担。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of evaluation in clinical practice 医学-卫生保健

CiteScore

4.80

自引率

4.20%

发文量

143

审稿时长

3-8 weeks

期刊介绍： The Journal of Evaluation in Clinical Practice aims to promote the evaluation and development of clinical practice across medicine, nursing and the allied health professions. All aspects of health services research and public health policy analysis and debate are of interest to the Journal whether studied from a population-based or individual patient-centred perspective. Of particular interest to the Journal are submissions on all aspects of clinical effectiveness and efficiency including evidence-based medicine, clinical practice guidelines, clinical decision making, clinical services organisation, implementation and delivery, health economic evaluation, health process and outcome measurement and new or improved methods (conceptual and statistical) for systematic inquiry into clinical practice. Papers may take a classical quantitative or qualitative approach to investigation (or may utilise both techniques) or may take the form of learned essays, structured/systematic reviews and critiques.