Integrating Machine Learning With Microsimulation to Classify Hypothetical, Novel Patients for Predicting Pregabalin Treatment Response Based on Observational and Randomized Data in Patients With Painful Diabetic Peripheral Neuropathy

IF 2.3 Q2 MEDICINE, GENERAL & INTERNAL Pragmatic and Observational Research Pub Date : 2019-10-31 DOI:10.2147/POR.S214412

J. Alexander, R. Edwards, L. Manca, Roberto Grugni, Gianluca Bonfanti, B. Emir, E. Whalen, S. Watt, M. Brodsky, B. Parsons

{"title":"Integrating Machine Learning With Microsimulation to Classify Hypothetical, Novel Patients for Predicting Pregabalin Treatment Response Based on Observational and Randomized Data in Patients With Painful Diabetic Peripheral Neuropathy","authors":"J. Alexander, R. Edwards, L. Manca, Roberto Grugni, Gianluca Bonfanti, B. Emir, E. Whalen, S. Watt, M. Brodsky, B. Parsons","doi":"10.2147/POR.S214412","DOIUrl":null,"url":null,"abstract":"Purpose Variability in patient treatment responses can be a barrier to effective care. Utilization of available patient databases may improve the prediction of treatment responses. We evaluated machine learning methods to predict novel, individual patient responses to pregabalin for painful diabetic peripheral neuropathy, utilizing an agent-based modeling and simulation platform that integrates real-world observational study (OS) data and randomized clinical trial (RCT) data. Patients and methods The best supervised machine learning methods were selected (through literature review) and combined in a novel way for aligning patients with relevant subgroups that best enable prediction of pregabalin responses. Data were derived from a German OS of pregabalin (N=2642) and nine international RCTs (N=1320). Coarsened exact matching of OS and RCT patients was used and a hierarchical cluster analysis was implemented. We tested which machine learning methods would best align candidate patients with specific clusters that predict their pain scores over time. Cluster alignments would trigger assignments of cluster-specific time-series regressions with lagged variables as inputs in order to simulate “virtual” patients and generate 1000 trajectory variations for given novel patients. Results Instance-based machine learning methods (k-nearest neighbor, supervised fuzzy c-means) were selected for quantitative analyses. Each method alone correctly classified 56.7% and 39.1% of patients, respectively. An “ensemble method” (combining both methods) correctly classified 98.4% and 95.9% of patients in the training and testing datasets, respectively. Conclusion An ensemble combination of two instance-based machine learning techniques best accommodated different data types (dichotomous, categorical, continuous) and performed better than either technique alone in assigning novel patients to subgroups for predicting treatment outcomes using microsimulation. Assignment of novel patients to a cluster of similar patients has the potential to improve prediction of patient outcomes for chronic conditions in which initial treatment response can be incorporated using microsimulation. Clinical trial registries www.clinicaltrials.gov: NCT00156078, NCT00159679, NCT00143156, NCT00553475.","PeriodicalId":20399,"journal":{"name":"Pragmatic and Observational Research","volume":"10 1","pages":"67 - 76"},"PeriodicalIF":2.3000,"publicationDate":"2019-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2147/POR.S214412","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pragmatic and Observational Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2147/POR.S214412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 5

Abstract

Purpose Variability in patient treatment responses can be a barrier to effective care. Utilization of available patient databases may improve the prediction of treatment responses. We evaluated machine learning methods to predict novel, individual patient responses to pregabalin for painful diabetic peripheral neuropathy, utilizing an agent-based modeling and simulation platform that integrates real-world observational study (OS) data and randomized clinical trial (RCT) data. Patients and methods The best supervised machine learning methods were selected (through literature review) and combined in a novel way for aligning patients with relevant subgroups that best enable prediction of pregabalin responses. Data were derived from a German OS of pregabalin (N=2642) and nine international RCTs (N=1320). Coarsened exact matching of OS and RCT patients was used and a hierarchical cluster analysis was implemented. We tested which machine learning methods would best align candidate patients with specific clusters that predict their pain scores over time. Cluster alignments would trigger assignments of cluster-specific time-series regressions with lagged variables as inputs in order to simulate “virtual” patients and generate 1000 trajectory variations for given novel patients. Results Instance-based machine learning methods (k-nearest neighbor, supervised fuzzy c-means) were selected for quantitative analyses. Each method alone correctly classified 56.7% and 39.1% of patients, respectively. An “ensemble method” (combining both methods) correctly classified 98.4% and 95.9% of patients in the training and testing datasets, respectively. Conclusion An ensemble combination of two instance-based machine learning techniques best accommodated different data types (dichotomous, categorical, continuous) and performed better than either technique alone in assigning novel patients to subgroups for predicting treatment outcomes using microsimulation. Assignment of novel patients to a cluster of similar patients has the potential to improve prediction of patient outcomes for chronic conditions in which initial treatment response can be incorporated using microsimulation. Clinical trial registries www.clinicaltrials.gov: NCT00156078, NCT00159679, NCT00143156, NCT00553475.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

将机器学习与微模拟相结合，根据疼痛性糖尿病周围神经病变患者的观察和随机数据，对假设的新患者进行分类，预测普瑞巴林治疗反应

目的患者治疗反应的可变性可能成为有效治疗的障碍。利用现有的患者数据库可以改善对治疗反应的预测。我们利用基于代理的建模和仿真平台，整合了现实世界观察性研究(OS)数据和随机临床试验(RCT)数据，评估了机器学习方法，以预测新的个体患者对普瑞巴林治疗疼痛性糖尿病周围神经病变的反应。通过文献回顾，我们选择了最佳的监督机器学习方法，并以一种新颖的方式将患者与相关亚组相结合，从而最好地预测普瑞巴林的反应。数据来源于一项德国普瑞巴林OS (N=2642)和9项国际rct (N=1320)。对OS和RCT患者进行粗粒度精确匹配，并进行分层聚类分析。我们测试了哪种机器学习方法可以最好地将候选患者与预测其疼痛评分的特定集群结合起来。集群对齐将触发特定集群的时间序列回归分配，滞后变量作为输入，以模拟“虚拟”患者，并为给定的新患者生成1000个轨迹变化。结果选择基于实例的机器学习方法(k近邻、监督模糊c均值)进行定量分析。两种方法单独分类正确率分别为56.7%和39.1%。“集成方法”(结合两种方法)在训练和测试数据集中分别正确分类了98.4%和95.9%的患者。两种基于实例的机器学习技术的集成组合最好地适应不同的数据类型(二分类、分类、连续)，并且在将新患者分配到亚组以使用微模拟预测治疗结果方面优于单独使用任何一种技术。将新患者分配到一组相似的患者中有可能改善慢性疾病患者预后的预测，其中初始治疗反应可以使用微模拟纳入。临床试验注册中心www.clinicaltrials.gov: NCT00156078, NCT00159679, NCT00143156, NCT00553475。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Pragmatic and Observational Research MEDICINE, GENERAL & INTERNAL-

自引率

0.00%

发文量

期刊介绍： Pragmatic and Observational Research is an international, peer-reviewed, open-access journal that publishes data from studies designed to closely reflect medical interventions in real-world clinical practice, providing insights beyond classical randomized controlled trials (RCTs). While RCTs maximize internal validity for cause-and-effect relationships, they often represent only specific patient groups. This journal aims to complement such studies by providing data that better mirrors real-world patients and the usage of medicines, thus informing guidelines and enhancing the applicability of research findings across diverse patient populations encountered in everyday clinical practice.