L. Mazzeo , F. Corso , P. Baili , F. Scotti , V. Torri , M. Ganzinelli , V. Mišković , R. Leporati , L. Provenzano , A. Spagnoletti , C. Silvestri , C. Giani , C. Cavalli , R.M. di Mauro , M. Meazza Prina , C. Proto , M. Brambilla , M. Occhipinti , S. Manglaviti , T. Beninato , A. Prelaj
{"title":"Data analytics for real-world data integration in TKI-treated NSCLC patients using electronic health records","authors":"L. Mazzeo , F. Corso , P. Baili , F. Scotti , V. Torri , M. Ganzinelli , V. Mišković , R. Leporati , L. Provenzano , A. Spagnoletti , C. Silvestri , C. Giani , C. Cavalli , R.M. di Mauro , M. Meazza Prina , C. Proto , M. Brambilla , M. Occhipinti , S. Manglaviti , T. Beninato , A. Prelaj","doi":"10.1016/j.esmorw.2024.100109","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Real-world data (RWD) are routinely collected in clinical practice during therapeutic interventions. Data warehouses (DWHs) represent the primary source of RWD in which electronic health records (EHRs) can be rapidly analyzed via natural language processing. This study illustrates an analytic framework that systematically exploits RWD and methods to generate real-world evidence (RWE) about innovative cancer drugs. The framework has been applied to investigate real-world treatment patterns and clinical outcomes of patients with advanced non-small-cell lung cancer (aNSCLC) treated with tyrosine kinase inhibitors (TKIs).</div></div><div><h3>Materials and methods</h3><div>Data from a cohort of 190 epidermal growth factor receptor-positive mutation (EGFRm) patients with aNSCLC were retrospectively collected in an Italian cancer institute between 2014 and 2022. Patients were treated in first-line (1L) with osimertinib or other TKIs (non-osimertinib). A text-mining algorithm was implemented to retrieve RWD from EHRs. Survival endpoints were median time to treatment discontinuation (mTTD) and median overall survival (mOS) estimated with Kaplan–Meier curves. Time-dependent multivariate Cox analysis was carried out to overcome immortal time bias.</div></div><div><h3>Results</h3><div>Approximately 38% of patients received 1L osimertinib, while the remaining 62% received previous-generation TKIs. Longer mTTD [15 months; 95% confidence interval (CI) 11.9-26.4 months] was found for patients treated with 1L osimertinib compared with non-osimertinib (10 months; 95% CI 7.9-13.1 months). In multivariate analysis, osimertinib was an independent protective factor regardless of bone and brain metastases and local radiotherapy. mOS was 27 months (95% CI 21.4-39.5 months) for osimertinib versus 20.2 months (95% CI 17.6-23.1 months) for non-osimertinib.</div></div><div><h3>Conclusions</h3><div>Data analytics frameworks are useful tools to integrate RWE in cancer research and data-driven models are suitable to process large amounts of RWD. This study demonstrates that real-world treatment patterns and outcomes of TKIs are comparable with those found in both clinical trials and other real-world studies. RWE studies can support clinicians in investigating the best treatment strategy and decision makers to drive new health policies.</div></div>","PeriodicalId":100491,"journal":{"name":"ESMO Real World Data and Digital Oncology","volume":"7 ","pages":"Article 100109"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ESMO Real World Data and Digital Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949820124000870","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/22 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Real-world data (RWD) are routinely collected in clinical practice during therapeutic interventions. Data warehouses (DWHs) represent the primary source of RWD in which electronic health records (EHRs) can be rapidly analyzed via natural language processing. This study illustrates an analytic framework that systematically exploits RWD and methods to generate real-world evidence (RWE) about innovative cancer drugs. The framework has been applied to investigate real-world treatment patterns and clinical outcomes of patients with advanced non-small-cell lung cancer (aNSCLC) treated with tyrosine kinase inhibitors (TKIs).
Materials and methods
Data from a cohort of 190 epidermal growth factor receptor-positive mutation (EGFRm) patients with aNSCLC were retrospectively collected in an Italian cancer institute between 2014 and 2022. Patients were treated in first-line (1L) with osimertinib or other TKIs (non-osimertinib). A text-mining algorithm was implemented to retrieve RWD from EHRs. Survival endpoints were median time to treatment discontinuation (mTTD) and median overall survival (mOS) estimated with Kaplan–Meier curves. Time-dependent multivariate Cox analysis was carried out to overcome immortal time bias.
Results
Approximately 38% of patients received 1L osimertinib, while the remaining 62% received previous-generation TKIs. Longer mTTD [15 months; 95% confidence interval (CI) 11.9-26.4 months] was found for patients treated with 1L osimertinib compared with non-osimertinib (10 months; 95% CI 7.9-13.1 months). In multivariate analysis, osimertinib was an independent protective factor regardless of bone and brain metastases and local radiotherapy. mOS was 27 months (95% CI 21.4-39.5 months) for osimertinib versus 20.2 months (95% CI 17.6-23.1 months) for non-osimertinib.
Conclusions
Data analytics frameworks are useful tools to integrate RWE in cancer research and data-driven models are suitable to process large amounts of RWD. This study demonstrates that real-world treatment patterns and outcomes of TKIs are comparable with those found in both clinical trials and other real-world studies. RWE studies can support clinicians in investigating the best treatment strategy and decision makers to drive new health policies.
现实世界数据(RWD)是在临床实践中常规收集的治疗干预。数据仓库(DWHs)是RWD的主要来源,其中电子健康记录(ehr)可以通过自然语言处理快速分析。本研究阐明了一个分析框架,该框架系统地利用RWD和方法来生成有关创新癌症药物的真实世界证据(RWE)。该框架已被用于研究使用酪氨酸激酶抑制剂(TKIs)治疗的晚期非小细胞肺癌(aNSCLC)患者的现实世界治疗模式和临床结果。材料和方法回顾性收集2014年至2022年意大利一家癌症研究所190例表皮生长因子受体阳性突变(EGFRm)的aNSCLC患者的数据。患者在一线(1L)接受奥西替尼或其他TKIs(非奥西替尼)治疗。采用文本挖掘算法从电子病历中检索RWD。生存终点是Kaplan-Meier曲线估计的中位停药时间(mTTD)和中位总生存期(mOS)。时间相关的多变量Cox分析克服了不朽的时间偏差。结果约38%的患者接受1L奥西替尼治疗,而其余62%的患者接受上一代TKIs治疗。更长的mTTD[15个月;95%可信区间(CI) 11.9-26.4个月],服用1L奥西替尼的患者与服用非奥西替尼的患者相比(10个月;95% CI 7.9-13.1个月)。在多因素分析中,奥西替尼是一个独立的保护因素,与骨和脑转移和局部放疗无关。奥西替尼组的mOS为27个月(95% CI 21.4-39.5个月),而非奥西替尼组的mOS为20.2个月(95% CI 17.6-23.1个月)。结论数据分析框架是将RWE整合到癌症研究中的有效工具,数据驱动模型适用于处理大量RWD。该研究表明,tki的现实世界治疗模式和结果与临床试验和其他现实世界研究中的结果具有可比性。RWE研究可以支持临床医生调查最佳治疗策略和决策者推动新的卫生政策。