Wellington Kanyongo , Absalom E. Ezugwu , Tsitsi Moyo , Jean Vincent Fonou Dombeu
{"title":"Machine learning-based classification of medication adherence among patients with noncommunicable diseases","authors":"Wellington Kanyongo , Absalom E. Ezugwu , Tsitsi Moyo , Jean Vincent Fonou Dombeu","doi":"10.1016/j.imu.2024.101611","DOIUrl":null,"url":null,"abstract":"<div><div>Non-adherence to medication among individuals with non-communicable diseases (NCDs) leads to increased morbidity, mortality, and healthcare costs. The integration of electronic drug prescription and dispensation systems enables comprehensive analysis of medication adherence (MA). Patient-level and medical claims data for 8141 diabetic and hypertensive patients in Harare, Zimbabwe, were analysed. Non-adherence was defined as medication refills falling below 75 % of the intended 12 monthly claims, while adherence required at least 75 % of the refills. Classification employed multiple machine learning algorithms, including SVM, KNN, DT, Naïve Bayes, DNN, LR, and RF in Python 3.11.3. Significant variables for MA were identified through the Random Forest (RF) feature importance mechanism and the information gain technique. These included the annual quantity of medical supplies, annual claim amount, patient age, wellness program subscription, medical aid cover, contribution towards medical aid cover, comorbidity, diagnosis, hospital cover type, complications development, gender, and medical aid scheme. The total units of medical supplies dispensed annually emerged as the most significant predictor of MA. Considering the 8-feature subset, which consistently produced the most robust machine learning models, the classification accuracy of the ML classifiers ranged from 84.9 % to 88.2 %, while the AUC values varied between 0.857 and 0.934. RF, an ensemble learning technique, was the most robust classifier, achieving 88.2 % accuracy, an AUC of 0.935, and superior precision, recall, and F1-score. This model shows promise as a prognostic tool for enhancing MA, aiding in identifying adherence levels among patients. These findings contribute to addressing disparities in medication refilling and adherence rates among patients with NCDs. The ML model holds potential for the development of intelligent MA and intervention applications to improve patient adherence to medication in the chronic disease domain.</div></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"52 ","pages":"Article 101611"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914824001680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
摘要
非传染性疾病(NCDs)患者不坚持用药会导致发病率、死亡率和医疗成本增加。通过整合电子处方和配药系统,可以对用药依从性(MA)进行全面分析。我们分析了津巴布韦哈拉雷 8141 名糖尿病和高血压患者的患者层面数据和医疗报销数据。不坚持用药的定义是药物补给量低于 12 个月预定补给量的 75%,而坚持用药至少需要 75% 的补给量。在 Python 3.11.3 中,分类采用了多种机器学习算法,包括 SVM、KNN、DT、Naïve Bayes、DNN、LR 和 RF。通过随机森林(RF)特征重要性机制和信息增益技术,确定了 MA 的重要变量。这些变量包括年度医疗用品数量、年度索赔金额、患者年龄、健康计划订阅情况、医疗补助覆盖范围、对医疗补助覆盖范围的贡献、合并症、诊断、医院覆盖类型、并发症发展情况、性别和医疗补助计划。每年配发的医疗用品总量是预测医疗保险的最重要因素。考虑到 8 个特征子集始终产生最稳健的机器学习模型,ML 分类器的分类准确率介于 84.9 % 到 88.2 % 之间,AUC 值介于 0.857 到 0.934 之间。RF是一种集合学习技术,是最稳健的分类器,准确率达到88.2%,AUC值为0.935,精确度、召回率和F1-分数都很出色。该模型有望成为增强 MA 的预后工具,帮助识别患者的依从性水平。这些发现有助于解决非传染性疾病患者在药物补充和依从率方面的差异。该 ML 模型具有开发智能 MA 和干预应用的潜力,可改善慢性病患者的用药依从性。
Machine learning-based classification of medication adherence among patients with noncommunicable diseases
Non-adherence to medication among individuals with non-communicable diseases (NCDs) leads to increased morbidity, mortality, and healthcare costs. The integration of electronic drug prescription and dispensation systems enables comprehensive analysis of medication adherence (MA). Patient-level and medical claims data for 8141 diabetic and hypertensive patients in Harare, Zimbabwe, were analysed. Non-adherence was defined as medication refills falling below 75 % of the intended 12 monthly claims, while adherence required at least 75 % of the refills. Classification employed multiple machine learning algorithms, including SVM, KNN, DT, Naïve Bayes, DNN, LR, and RF in Python 3.11.3. Significant variables for MA were identified through the Random Forest (RF) feature importance mechanism and the information gain technique. These included the annual quantity of medical supplies, annual claim amount, patient age, wellness program subscription, medical aid cover, contribution towards medical aid cover, comorbidity, diagnosis, hospital cover type, complications development, gender, and medical aid scheme. The total units of medical supplies dispensed annually emerged as the most significant predictor of MA. Considering the 8-feature subset, which consistently produced the most robust machine learning models, the classification accuracy of the ML classifiers ranged from 84.9 % to 88.2 %, while the AUC values varied between 0.857 and 0.934. RF, an ensemble learning technique, was the most robust classifier, achieving 88.2 % accuracy, an AUC of 0.935, and superior precision, recall, and F1-score. This model shows promise as a prognostic tool for enhancing MA, aiding in identifying adherence levels among patients. These findings contribute to addressing disparities in medication refilling and adherence rates among patients with NCDs. The ML model holds potential for the development of intelligent MA and intervention applications to improve patient adherence to medication in the chronic disease domain.
期刊介绍:
Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.