首页 > 最新文献

JCO Clinical Cancer Informatics最新文献

英文 中文
Minimal Common Oncology Data Elements Genomics Pilot Project: Enhancing Oncology Research Through Electronic Health Record Interoperability at Vanderbilt University Medical Center. 最小通用肿瘤数据元素基因组学试点项目:范德比尔特大学医学中心通过电子健康记录互操作性加强肿瘤学研究。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.23.00249
Yanwei Li, Jiarong Ye, Yuxin Huang, Jiayi Wu, Xiaohan Liu, Shun Ahmed, Travis Osterman

Purpose: The expanding presence of the electronic health record (EHR) underscores the necessity for improved interoperability. To test the interoperability within the field of oncology research, our team at Vanderbilt University Medical Center (VUMC) enabled our Epic-based EHR to be compatible with the Minimal Common Oncology Data Elements (mCODE), which is a Fast Healthcare Interoperability Resources (FHIR)-based consensus data standard created to facilitate the transmission of EHRs for patients with cancer.

Methods: Our approach used an extract, transform, load tool for converting EHR data from the VUMC Epic Clarity database into mCODE-compatible profiles. We established a sandbox environment on Microsoft Azure for data migration, deployed a FHIR server to handle application programming interface (API) requests, and mapped VUMC data to align with mCODE structures. In addition, we constructed a web application to demonstrate the practical use of mCODE profiles in health care.

Results: We developed an end-to-end pipeline that converted EHR data into mCODE-compliant profiles, as well as a web application that visualizes genomic data and provides cancer risk assessments. Despite the complexities of aligning traditional EHR databases with mCODE standards and the limitations of FHIR APIs in supporting advanced statistical methodologies, this project successfully demonstrates the practical integration of mCODE standards into existing health care infrastructures.

Conclusion: This study provides a proof of concept for the interoperability of mCODE within a major health care institution's EHR system, highlighting both the potential and the current limitations of FHIR APIs in supporting complex data analysis for oncology research.

目的:电子病历(EHR)的应用范围不断扩大,凸显了提高互操作性的必要性。为了测试肿瘤学研究领域的互操作性,我们范德比尔特大学医学中心(VUMC)的团队使我们基于 Epic 的电子病历与最小通用肿瘤学数据元素(mCODE)兼容,后者是基于快速医疗互操作性资源(FHIR)的共识数据标准,旨在促进癌症患者电子病历的传输:我们的方法是使用一种提取、转换、加载工具,将 VUMC Epic Clarity 数据库中的电子病历数据转换为与 mCODE 兼容的配置文件。我们在 Microsoft Azure 上建立了一个用于数据迁移的沙盒环境,部署了一个 FHIR 服务器来处理应用编程接口(API)请求,并映射 VUMC 数据以与 mCODE 结构保持一致。此外,我们还构建了一个网络应用程序,以演示 mCODE 配置文件在医疗保健领域的实际应用:我们开发了一个端到端的管道,可将电子病历数据转换成符合 mCODE 标准的档案,还开发了一个网络应用程序,可将基因组数据可视化并提供癌症风险评估。尽管将传统的电子病历数据库与 mCODE 标准相匹配非常复杂,而且 FHIR API 在支持高级统计方法方面存在局限性,但该项目成功地展示了将 mCODE 标准实际整合到现有医疗基础设施中的可行性:本研究为 mCODE 在一家大型医疗机构的 EHR 系统中的互操作性提供了概念验证,突出了 FHIR API 在支持肿瘤研究复杂数据分析方面的潜力和当前局限性。
{"title":"Minimal Common Oncology Data Elements Genomics Pilot Project: Enhancing Oncology Research Through Electronic Health Record Interoperability at Vanderbilt University Medical Center.","authors":"Yanwei Li, Jiarong Ye, Yuxin Huang, Jiayi Wu, Xiaohan Liu, Shun Ahmed, Travis Osterman","doi":"10.1200/CCI.23.00249","DOIUrl":"10.1200/CCI.23.00249","url":null,"abstract":"<p><strong>Purpose: </strong>The expanding presence of the electronic health record (EHR) underscores the necessity for improved interoperability. To test the interoperability within the field of oncology research, our team at Vanderbilt University Medical Center (VUMC) enabled our Epic-based EHR to be compatible with the Minimal Common Oncology Data Elements (mCODE), which is a Fast Healthcare Interoperability Resources (FHIR)-based consensus data standard created to facilitate the transmission of EHRs for patients with cancer.</p><p><strong>Methods: </strong>Our approach used an extract, transform, load tool for converting EHR data from the VUMC Epic Clarity database into mCODE-compatible profiles. We established a sandbox environment on Microsoft Azure for data migration, deployed a FHIR server to handle application programming interface (API) requests, and mapped VUMC data to align with mCODE structures. In addition, we constructed a web application to demonstrate the practical use of mCODE profiles in health care.</p><p><strong>Results: </strong>We developed an end-to-end pipeline that converted EHR data into mCODE-compliant profiles, as well as a web application that visualizes genomic data and provides cancer risk assessments. Despite the complexities of aligning traditional EHR databases with mCODE standards and the limitations of FHIR APIs in supporting advanced statistical methodologies, this project successfully demonstrates the practical integration of mCODE standards into existing health care infrastructures.</p><p><strong>Conclusion: </strong>This study provides a proof of concept for the interoperability of mCODE within a major health care institution's EHR system, highlighting both the potential and the current limitations of FHIR APIs in supporting complex data analysis for oncology research.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2300249"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371088/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141472502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
When Will Real-World Data Fulfill Its Promise to Provide Timely Insights in Oncology? 真实世界数据何时才能兑现其在肿瘤学领域提供及时见解的承诺?
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.24.00039
Marc L Berger, Patricia A Ganz, Kelly H Zou, Sheldon Greenfield

Randomized trials provide high-quality, internally consistent data on selected clinical questions, but lack generalizability for the aging population who are most often diagnosed with cancer and have comorbid conditions that may affect the interpretation of treatment benefit. The need for high-quality, relevant, and timely data is greater than ever. Promising solutions lie in the collection and analysis of real-world data (RWD), which can potentially provide timely insights about the patient's course during and after initial treatment and the outcomes of important subgroups such as the elderly, rural populations, children, and patients with greater social health needs. However, to inform practice and policy, real-world evidence must be created from trustworthy and comprehensive sources of RWD; these may include pragmatic clinical trials, registries, prospective observational studies, electronic health records (EHRs), administrative claims, and digital technologies. There are unique challenges in oncology since key parameters (eg, cancer stage, biomarker status, genomic assays, imaging response, side effects, quality of life) are not recorded, siloed in inaccessible documents, or available only as free text or unstructured reports in the EHR. Advances in analytics, such as artificial intelligence, may greatly enhance the ability to obtain more granular information from EHRs and support integrated diagnostics; however, they will need to be validated purpose by purpose. We recommend a commitment to standardizing data across sources and building infrastructures that can produce fit-for-purpose RWD that will provide timely understanding of the effectiveness of individual interventions.

随机试验为选定的临床问题提供了高质量、内部一致的数据,但对于经常被诊断为癌症并患有可能影响治疗效果解释的合并症的老龄人口来说,随机试验缺乏普遍性。现在比以往任何时候都更需要高质量、相关和及时的数据。有希望的解决方案在于收集和分析真实世界数据(RWD),这些数据有可能为患者在初始治疗期间和之后的治疗过程以及老年人、农村人口、儿童和有更多社会健康需求的患者等重要亚群的治疗结果提供及时的见解。然而,要为实践和政策提供信息,必须从可信的、全面的 RWD 来源中获取真实世界的证据;这些来源可能包括实用的临床试验、登记处、前瞻性观察研究、电子健康记录 (EHR)、行政索赔和数字技术。肿瘤学面临着独特的挑战,因为关键参数(如癌症分期、生物标记物状态、基因组检测、成像反应、副作用、生活质量)没有记录、被孤立在无法访问的文档中,或者只能以自由文本或非结构化报告的形式出现在电子病历中。人工智能等分析技术的进步可能会大大提高从电子病历中获取更精细信息的能力,并为综合诊断提供支持;但是,这些技术还需要逐项进行验证。我们建议致力于实现不同来源数据的标准化,并建立能够产生适合目的的 RWD 的基础设施,以便及时了解个别干预措施的有效性。
{"title":"When Will Real-World Data Fulfill Its Promise to Provide Timely Insights in Oncology?","authors":"Marc L Berger, Patricia A Ganz, Kelly H Zou, Sheldon Greenfield","doi":"10.1200/CCI.24.00039","DOIUrl":"https://doi.org/10.1200/CCI.24.00039","url":null,"abstract":"<p><p>Randomized trials provide high-quality, internally consistent data on selected clinical questions, but lack generalizability for the aging population who are most often diagnosed with cancer and have comorbid conditions that may affect the interpretation of treatment benefit. The need for high-quality, relevant, and timely data is greater than ever. Promising solutions lie in the collection and analysis of real-world data (RWD), which can potentially provide timely insights about the patient's course during and after initial treatment and the outcomes of important subgroups such as the elderly, rural populations, children, and patients with greater social health needs. However, to inform practice and policy, real-world evidence must be created from trustworthy and comprehensive sources of RWD; these may include pragmatic clinical trials, registries, prospective observational studies, electronic health records (EHRs), administrative claims, and digital technologies. There are unique challenges in oncology since key parameters (eg, cancer stage, biomarker status, genomic assays, imaging response, side effects, quality of life) are not recorded, siloed in inaccessible documents, or available only as free text or unstructured reports in the EHR. Advances in analytics, such as artificial intelligence, may greatly enhance the ability to obtain more granular information from EHRs and support integrated diagnostics; however, they will need to be validated purpose by purpose. We recommend a commitment to standardizing data across sources and building infrastructures that can produce fit-for-purpose RWD that will provide timely understanding of the effectiveness of individual interventions.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400039"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141477942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Population-Level Identification of Patients With Lynch Syndrome for Clinical Care, Quality Improvement, and Research. 为临床护理、质量改进和研究而进行的林奇综合征患者人群识别。
IF 4.2 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.23.00157
Ravi N Sharaf, Natalia Udaltsova, Dan Li, Rish K Pai, Soham Sinha, Zixuan Li, Douglas A Corley

Purpose: Identification of those at risk of hereditary cancer syndromes using electronic health record (EHR) data sources is important for clinical care, quality improvement, and research. We describe diagnostic processes, previously seldom reported, for a common hereditary cancer syndrome, Lynch syndrome (LS), using EHR data within a community-based, multicenter, demographically diverse health system.

Methods: Within a retrospective cohort enrolled between 2015 and 2020 at Kaiser Permanente Northern California, we assessed electronic diagnostic domains for LS including (1) family history of LS-associated cancer; (2) personal history of LS-associated cancer; (3) LS screening via mismatch repair deficiency (MMRD) testing of newly diagnosed malignancy; (4) germline genetic test results; and (5) clinician-entered diagnostic codes for LS. We calculated proportions and overlap for each diagnostic domain descriptively.

Results: Among 5.8 million individuals, (1) 28,492 (0.49%) had a family history of LS-associated cancer of whom 3,635 (13%) underwent genetic testing; (2) 100,046 (1.7%) had a personal history of a LS-associated cancer; and (3) 8,711 (0.1%) were diagnosed with colorectal cancer of whom 7,533 (86%) underwent MMRD screening and of the positive screens (486), 130 (27%) underwent germline testing. One thousand seven hundred and fifty-seven (0.03%) were diagnosed with endometrial cancer of whom 1,613 (92%) underwent MMRD screening and of the 195 who screened positive, 55 (28%) underwent genetic testing. (4) 30,790 (0.05%) had LS germline genetic testing with 707 (0.01%) testing positive; and (5) 1,273 (0.02%) had a clinician-entered diagnosis of LS.

Conclusion: It is feasible to electronically characterize the diagnostic processes of LS. No single data source comprehensively identifies all LS carriers. There is underutilization of LS genetic testing for those eligible and underdiagnosis of LS. Our work informs similar efforts in other settings for hereditary cancer syndromes.

目的:利用电子健康记录(EHR)数据源识别遗传性癌症综合征高危人群对于临床护理、质量改进和研究非常重要。我们描述了一种常见的遗传性癌症综合征--林奇综合征(Lynch syndrome,LS)的诊断过程,该诊断过程以前很少报道,我们是在一个基于社区、多中心、人口统计学多样化的医疗系统中使用电子病历数据进行诊断的:在北加州凯泽医疗集团(Kaiser Permanente Northern California)2015 年至 2020 年间登记的回顾性队列中,我们评估了林奇综合征的电子诊断域,包括:(1)林奇综合征相关癌症的家族史;(2)林奇综合征相关癌症的个人史;(3)通过新诊断恶性肿瘤的错配修复缺陷(MMRD)检测进行的林奇综合征筛查;(4)种系遗传检测结果;以及(5)临床医生输入的林奇综合征诊断代码。我们对每个诊断领域的比例和重叠情况进行了描述性计算:在 580 万人中,(1) 28,492 人(0.49%)有 LS 相关癌症家族史,其中 3,635 人(13%)接受了基因检测;(2) 100,046 人(1.7%)有 LS 相关癌症个人史;(3) 8,711 人(0.1%)被诊断出患有结直肠癌,其中 7533 人(86%)接受了 MMRD 筛查,在阳性筛查结果(486 人)中,130 人(27%)接受了种系检测。有 1757 人(0.03%)被确诊为子宫内膜癌,其中 1613 人(92%)接受了 MMRD 筛查,在筛查结果呈阳性的 195 人中,55 人(28%)接受了基因检测。(4) 30790人(0.05%)进行了LS种系基因检测,其中707人(0.01%)检测结果呈阳性;(5) 1273人(0.02%)经临床医生诊断为LS:结论:以电子方式描述LS的诊断过程是可行的。没有一个数据源能全面识别所有 LS 携带者。对于符合条件的人来说,LS基因检测的利用率不足,LS的诊断率也不足。我们的工作为其他环境下的遗传性癌症综合征类似工作提供了参考。
{"title":"Population-Level Identification of Patients With Lynch Syndrome for Clinical Care, Quality Improvement, and Research.","authors":"Ravi N Sharaf, Natalia Udaltsova, Dan Li, Rish K Pai, Soham Sinha, Zixuan Li, Douglas A Corley","doi":"10.1200/CCI.23.00157","DOIUrl":"https://doi.org/10.1200/CCI.23.00157","url":null,"abstract":"<p><strong>Purpose: </strong>Identification of those at risk of hereditary cancer syndromes using electronic health record (EHR) data sources is important for clinical care, quality improvement, and research. We describe diagnostic processes, previously seldom reported, for a common hereditary cancer syndrome, Lynch syndrome (LS), using EHR data within a community-based, multicenter, demographically diverse health system.</p><p><strong>Methods: </strong>Within a retrospective cohort enrolled between 2015 and 2020 at Kaiser Permanente Northern California, we assessed electronic diagnostic domains for LS including (1) family history of LS-associated cancer; (2) personal history of LS-associated cancer; (3) LS screening via mismatch repair deficiency (MMRD) testing of newly diagnosed malignancy; (4) germline genetic test results; and (5) clinician-entered diagnostic codes for LS. We calculated proportions and overlap for each diagnostic domain descriptively.</p><p><strong>Results: </strong>Among 5.8 million individuals, (1) 28,492 (0.49%) had a family history of LS-associated cancer of whom 3,635 (13%) underwent genetic testing; (2) 100,046 (1.7%) had a personal history of a LS-associated cancer; and (3) 8,711 (0.1%) were diagnosed with colorectal cancer of whom 7,533 (86%) underwent MMRD screening and of the positive screens (486), 130 (27%) underwent germline testing. One thousand seven hundred and fifty-seven (0.03%) were diagnosed with endometrial cancer of whom 1,613 (92%) underwent MMRD screening and of the 195 who screened positive, 55 (28%) underwent genetic testing. (4) 30,790 (0.05%) had LS germline genetic testing with 707 (0.01%) testing positive; and (5) 1,273 (0.02%) had a clinician-entered diagnosis of LS.</p><p><strong>Conclusion: </strong>It is feasible to electronically characterize the diagnostic processes of LS. No single data source comprehensively identifies all LS carriers. There is underutilization of LS genetic testing for those eligible and underdiagnosis of LS. Our work informs similar efforts in other settings for hereditary cancer syndromes.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2300157"},"PeriodicalIF":4.2,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141262671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Clinical Registry Data Quality via Linkage With Survival Data From State-Based Population Registries. 通过与州人口登记处的生存数据建立联系,提高临床登记数据质量。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.24.00025
Samuel Smith, Kate Drummond, Anthony Dowling, Iwan Bennett, David Campbell, Ronnie Freilich, Claire Phillips, Elizabeth Ahern, Simone Reeves, Robert Campbell, Ian M Collins, Julie Johns, Megan Dumas, Wei Hong, Peter Gibbs, Lucy Gately

Purpose: Real-world data (RWD) collected on patients treated as part of routine clinical care form the basis of cancer clinical registries. Capturing accurate death data can be challenging, with inaccurate survival data potentially compromising the integrity of registry-based research. Here, we explore the utility of data linkage (DL) to state-based registries to enhance the capture of survival outcomes.

Methods: We identified consecutive adult patients with brain tumors treated in the state of Victoria from the Brain Tumour Registry Australia: Innovation and Translation (BRAIN) database, who had no recorded date of death and no follow-up within the last 6 months. Full name and date of birth were used to match patients in the BRAIN registry with those in the Victorian Births, Deaths and Marriages (BDM) registry. Overall survival (OS) outcomes were compared pre- and post-DL.

Results: Of the 7,346 clinical registry patients, 5,462 (74%) had no date of death and no follow-up recorded within the last 6 months. Of the 5,462 patients, 1,588 (29%) were matched with a date of death in BDM. Factors associated with an increased number of matches were poor prognosis tumors, older age, and social disadvantage. OS was significantly overestimated pre-DL compared with post-DL for the entire cohort (pre- v post-DL: hazard ratio, 1.43; P < .001; median, 29.9 months v 16.7 months) and for most individual tumor types. This finding was present independent of the tumor prognosis.

Conclusion: As revealed by linkage with BDM, a high proportion of patients in a brain cancer clinical registry had missing death data, contributed to by informative censoring, inflating OS calculations. DL to pertinent registries on an ongoing basis should be considered to ensure accurate reporting of survival data and interpretation of RWD outcomes.

目的:在常规临床治疗过程中收集的患者真实世界数据(RWD)是癌症临床登记的基础。获取准确的死亡数据具有挑战性,不准确的生存数据可能会损害基于登记册的研究的完整性。在此,我们探讨了将数据链接(DL)到基于州的登记处以加强生存结果采集的实用性:方法:我们从澳大利亚脑肿瘤登记处(Brain Tumour Registry Australia:方法:我们从澳大利亚脑肿瘤登记:创新与转化(BRAIN)数据库中筛选出在维多利亚州接受治疗的连续成年脑肿瘤患者,这些患者在过去 6 个月内没有死亡日期记录,也没有随访记录。全名和出生日期用于将 BRAIN 登记中的患者与维多利亚州出生、死亡和婚姻(BDM)登记中的患者进行匹配。比较了DL前后的总生存期(OS)结果:在 7,346 名临床登记患者中,有 5,462 人(74%)在过去 6 个月内没有死亡日期和随访记录。在这 5462 名患者中,有 1588 人(29%)与 BDM 中的死亡日期相匹配。与匹配人数增加相关的因素包括预后不良的肿瘤、年龄偏大和社会处境不利。在整个队列中,DL前的OS明显高于DL后的OS(DL前与DL后:危险比,1.43;P < .001;中位数,29.9个月与16.7个月),大多数单个肿瘤类型的OS也是如此。这一发现与肿瘤预后无关:结论:通过与 BDM 的连接发现,脑癌临床登记处有很大一部分患者的死亡数据缺失,而这是信息性普查造成的,从而提高了 OS 的计算结果。应考虑持续与相关登记处进行 DL 连接,以确保准确报告生存数据和解释 RWD 结果。
{"title":"Improving Clinical Registry Data Quality via Linkage With Survival Data From State-Based Population Registries.","authors":"Samuel Smith, Kate Drummond, Anthony Dowling, Iwan Bennett, David Campbell, Ronnie Freilich, Claire Phillips, Elizabeth Ahern, Simone Reeves, Robert Campbell, Ian M Collins, Julie Johns, Megan Dumas, Wei Hong, Peter Gibbs, Lucy Gately","doi":"10.1200/CCI.24.00025","DOIUrl":"https://doi.org/10.1200/CCI.24.00025","url":null,"abstract":"<p><strong>Purpose: </strong>Real-world data (RWD) collected on patients treated as part of routine clinical care form the basis of cancer clinical registries. Capturing accurate death data can be challenging, with inaccurate survival data potentially compromising the integrity of registry-based research. Here, we explore the utility of data linkage (DL) to state-based registries to enhance the capture of survival outcomes.</p><p><strong>Methods: </strong>We identified consecutive adult patients with brain tumors treated in the state of Victoria from the Brain Tumour Registry Australia: Innovation and Translation (BRAIN) database, who had no recorded date of death and no follow-up within the last 6 months. Full name and date of birth were used to match patients in the BRAIN registry with those in the Victorian Births, Deaths and Marriages (BDM) registry. Overall survival (OS) outcomes were compared pre- and post-DL.</p><p><strong>Results: </strong>Of the 7,346 clinical registry patients, 5,462 (74%) had no date of death and no follow-up recorded within the last 6 months. Of the 5,462 patients, 1,588 (29%) were matched with a date of death in BDM. Factors associated with an increased number of matches were poor prognosis tumors, older age, and social disadvantage. OS was significantly overestimated pre-DL compared with post-DL for the entire cohort (pre- <i>v</i> post-DL: hazard ratio, 1.43; <i>P</i> < .001; median, 29.9 months <i>v</i> 16.7 months) and for most individual tumor types. This finding was present independent of the tumor prognosis.</p><p><strong>Conclusion: </strong>As revealed by linkage with BDM, a high proportion of patients in a brain cancer clinical registry had missing death data, contributed to by informative censoring, inflating OS calculations. DL to pertinent registries on an ongoing basis should be considered to ensure accurate reporting of survival data and interpretation of RWD outcomes.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400025"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141460687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MOSAIC: An Artificial Intelligence-Based Framework for Multimodal Analysis, Classification, and Personalized Prognostic Assessment in Rare Cancers. MOSAIC:基于人工智能的罕见癌症多模式分析、分类和个性化预后评估框架。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.24.00008
Saverio D'Amico, Lorenzo Dall'Olio, Cesare Rollo, Patricia Alonso, Iñigo Prada-Luengo, Daniele Dall'Olio, Claudia Sala, Elisabetta Sauta, Gianluca Asti, Luca Lanino, Giulia Maggioni, Alessia Campagna, Elena Zazzetti, Mattia Delleani, Maria Elena Bicchieri, Pierandrea Morandini, Victor Savevski, Borja Arroyo, Juan Parras, Lin Pierre Zhao, Uwe Platzbecker, Maria Diez-Campelo, Valeria Santini, Pierre Fenaux, Torsten Haferlach, Anders Krogh, Santiago Zazo, Piero Fariselli, Tiziana Sanavia, Matteo Giovanni Della Porta, Gastone Castellani

Purpose: Rare cancers constitute over 20% of human neoplasms, often affecting patients with unmet medical needs. The development of effective classification and prognostication systems is crucial to improve the decision-making process and drive innovative treatment strategies. We have created and implemented MOSAIC, an artificial intelligence (AI)-based framework designed for multimodal analysis, classification, and personalized prognostic assessment in rare cancers. Clinical validation was performed on myelodysplastic syndrome (MDS), a rare hematologic cancer with clinical and genomic heterogeneities.

Methods: We analyzed 4,427 patients with MDS divided into training and validation cohorts. Deep learning methods were applied to integrate and impute clinical/genomic features. Clustering was performed by combining Uniform Manifold Approximation and Projection for Dimension Reduction + Hierarchical Density-Based Spatial Clustering of Applications with Noise (UMAP + HDBSCAN) methods, compared with the conventional Hierarchical Dirichlet Process (HDP). Linear and AI-based nonlinear approaches were compared for survival prediction. Explainable AI (Shapley Additive Explanations approach [SHAP]) and federated learning were used to improve the interpretation and the performance of the clinical models, integrating them into distributed infrastructure.

Results: UMAP + HDBSCAN clustering obtained a more granular patient stratification, achieving a higher average silhouette coefficient (0.16) with respect to HDP (0.01) and higher balanced accuracy in cluster classification by Random Forest (92.7% ± 1.3% and 85.8% ± 0.8%). AI methods for survival prediction outperform conventional statistical techniques and the reference prognostic tool for MDS. Nonlinear Gradient Boosting Survival stands in the internal (Concordance-Index [C-Index], 0.77; SD, 0.01) and external validation (C-Index, 0.74; SD, 0.02). SHAP analysis revealed that similar features drove patients' subgroups and outcomes in both training and validation cohorts. Federated implementation improved the accuracy of developed models.

Conclusion: MOSAIC provides an explainable and robust framework to optimize classification and prognostic assessment of rare cancers. AI-based approaches demonstrated superior accuracy in capturing genomic similarities and providing individual prognostic information compared with conventional statistical methods. Its federated implementation ensures broad clinical application, guaranteeing high performance and data protection.

目的:罕见癌症占人类肿瘤的 20% 以上,通常会影响到尚未满足医疗需求的患者。开发有效的分类和预后系统对于改善决策过程和推动创新治疗策略至关重要。我们创建并实施了 MOSAIC,这是一个基于人工智能(AI)的框架,旨在对罕见癌症进行多模态分析、分类和个性化预后评估。骨髓增生异常综合征(MDS)是一种罕见的血液肿瘤,具有临床和基因组异质性,我们对其进行了临床验证:我们对 4427 名 MDS 患者进行了分析,将其分为训练组和验证组。应用深度学习方法整合和估算临床/基因组特征。与传统的分层迪里希勒过程(HDP)相比,聚类是通过结合统一表层逼近和投影降维+基于密度的分层空间聚类(UMAP + HDBSCAN)方法进行的。在生存预测方面,对线性方法和基于人工智能的非线性方法进行了比较。可解释人工智能(Shapley Additive Explanations approach [SHAP])和联合学习被用来改进临床模型的解释和性能,并将它们集成到分布式基础设施中:UMAP+HDBSCAN聚类方法获得了更精细的患者分层,与HDP(0.01)相比,平均剪影系数(0.16)更高,随机森林(92.7%±1.3%)和85.8%±0.8%)聚类分类的均衡准确率更高。人工智能生存预测方法优于传统统计技术和 MDS 的参考预后工具。非线性梯度提升生存率在内部验证(Concordance-Index [C-Index],0.77;SD,0.01)和外部验证(C-Index,0.74;SD,0.02)中均名列前茅。SHAP分析表明,在训练组和验证组中,类似的特征驱动着患者的亚组和结果。联合实施提高了所开发模型的准确性:MOSAIC为优化罕见癌症的分类和预后评估提供了一个可解释且稳健的框架。与传统统计方法相比,基于人工智能的方法在捕捉基因组相似性和提供个体预后信息方面表现出更高的准确性。其联合实施确保了广泛的临床应用,保证了高性能和数据保护。
{"title":"MOSAIC: An Artificial Intelligence-Based Framework for Multimodal Analysis, Classification, and Personalized Prognostic Assessment in Rare Cancers.","authors":"Saverio D'Amico, Lorenzo Dall'Olio, Cesare Rollo, Patricia Alonso, Iñigo Prada-Luengo, Daniele Dall'Olio, Claudia Sala, Elisabetta Sauta, Gianluca Asti, Luca Lanino, Giulia Maggioni, Alessia Campagna, Elena Zazzetti, Mattia Delleani, Maria Elena Bicchieri, Pierandrea Morandini, Victor Savevski, Borja Arroyo, Juan Parras, Lin Pierre Zhao, Uwe Platzbecker, Maria Diez-Campelo, Valeria Santini, Pierre Fenaux, Torsten Haferlach, Anders Krogh, Santiago Zazo, Piero Fariselli, Tiziana Sanavia, Matteo Giovanni Della Porta, Gastone Castellani","doi":"10.1200/CCI.24.00008","DOIUrl":"10.1200/CCI.24.00008","url":null,"abstract":"<p><strong>Purpose: </strong>Rare cancers constitute over 20% of human neoplasms, often affecting patients with unmet medical needs. The development of effective classification and prognostication systems is crucial to improve the decision-making process and drive innovative treatment strategies. We have created and implemented MOSAIC, an artificial intelligence (AI)-based framework designed for multimodal analysis, classification, and personalized prognostic assessment in rare cancers. Clinical validation was performed on myelodysplastic syndrome (MDS), a rare hematologic cancer with clinical and genomic heterogeneities.</p><p><strong>Methods: </strong>We analyzed 4,427 patients with MDS divided into training and validation cohorts. Deep learning methods were applied to integrate and impute clinical/genomic features. Clustering was performed by combining Uniform Manifold Approximation and Projection for Dimension Reduction + Hierarchical Density-Based Spatial Clustering of Applications with Noise (UMAP + HDBSCAN) methods, compared with the conventional Hierarchical Dirichlet Process (HDP). Linear and AI-based nonlinear approaches were compared for survival prediction. Explainable AI (Shapley Additive Explanations approach [SHAP]) and federated learning were used to improve the interpretation and the performance of the clinical models, integrating them into distributed infrastructure.</p><p><strong>Results: </strong>UMAP + HDBSCAN clustering obtained a more granular patient stratification, achieving a higher average silhouette coefficient (0.16) with respect to HDP (0.01) and higher balanced accuracy in cluster classification by Random Forest (92.7% ± 1.3% and 85.8% ± 0.8%). AI methods for survival prediction outperform conventional statistical techniques and the reference prognostic tool for MDS. Nonlinear Gradient Boosting Survival stands in the internal (Concordance-Index [C-Index], 0.77; SD, 0.01) and external validation (C-Index, 0.74; SD, 0.02). SHAP analysis revealed that similar features drove patients' subgroups and outcomes in both training and validation cohorts. Federated implementation improved the accuracy of developed models.</p><p><strong>Conclusion: </strong>MOSAIC provides an explainable and robust framework to optimize classification and prognostic assessment of rare cancers. AI-based approaches demonstrated superior accuracy in capturing genomic similarities and providing individual prognostic information compared with conventional statistical methods. Its federated implementation ensures broad clinical application, guaranteeing high performance and data protection.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400008"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371092/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141321999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum: Phenotyping Hepatic Immune-Related Adverse Events in the Setting of Immune Checkpoint Inhibitor Therapy. 勘误:免疫检查点抑制剂治疗过程中肝脏免疫相关不良事件的表型分析
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.24.00125
{"title":"Erratum: Phenotyping Hepatic Immune-Related Adverse Events in the Setting of Immune Checkpoint Inhibitor Therapy.","authors":"","doi":"10.1200/CCI.24.00125","DOIUrl":"https://doi.org/10.1200/CCI.24.00125","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400125"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141443743","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Addressing the Historic Challenges of BRCA1 and BRCA2 Variant Curation and the Need for More Diverse Representation. 应对 BRCA1 和 BRCA2 变体保存的历史性挑战,以及更多元化代表的需求。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.24.00076
Yanin Chavarri-Guerra, Jeffrey N Weitzel

Advancements in variant curation challenges: minority representation and incomplete data reporting.

推进变体整理的挑战:少数群体的代表性和不完整的数据报告。
{"title":"Addressing the Historic Challenges of <i>BRCA1</i> and <i>BRCA2</i> Variant Curation and the Need for More Diverse Representation.","authors":"Yanin Chavarri-Guerra, Jeffrey N Weitzel","doi":"10.1200/CCI.24.00076","DOIUrl":"10.1200/CCI.24.00076","url":null,"abstract":"<p><p>Advancements in variant curation challenges: minority representation and incomplete data reporting.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400076"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141307387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Future of Cancer Treatment Guidelines: Integrating Real-World Insights for Equitable Cancer Care. 癌症治疗指南的未来:整合现实世界的洞察力,实现公平的癌症护理。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.24.00081
Rebecca A Miksad, Gregory S Calip
{"title":"Future of Cancer Treatment Guidelines: Integrating Real-World Insights for Equitable Cancer Care.","authors":"Rebecca A Miksad, Gregory S Calip","doi":"10.1200/CCI.24.00081","DOIUrl":"https://doi.org/10.1200/CCI.24.00081","url":null,"abstract":"","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2400081"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141499587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extracting Systemic Anticancer Therapy and Response Information From Clinical Notes Following the RECIST Definition. 根据 RECIST 定义从临床笔记中提取系统抗癌疗法和反应信息。
IF 4.2 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.23.00166
Xu Zuo, Ashok Kumar, Shuhan Shen, Jianfu Li, Grace Cong, Edward Jin, Qingxia Chen, Jeremy L Warner, Ping Yang, Hua Xu

Purpose: The RECIST guidelines provide a standardized approach for evaluating the response of cancer to treatment, allowing for consistent comparison of treatment efficacy across different therapies and patients. However, collecting such information from electronic health records manually can be extremely labor-intensive and time-consuming because of the complexity and volume of clinical notes. The aim of this study is to apply natural language processing (NLP) techniques to automate this process, minimizing manual data collection efforts, and improving the consistency and reliability of the results.

Methods: We proposed a complex, hybrid NLP system that automates the process of extracting, linking, and summarizing anticancer therapy and associated RECIST-like responses from narrative clinical text. The system consists of multiple machine learning-/deep learning-based and rule-based modules for diverse NLP tasks such as named entity recognition, assertion classification, relation extraction, and text normalization, to address different challenges associated with anticancer therapy and response information extraction. We then evaluated the system performances on two independent test sets from different institutions to demonstrate its effectiveness and generalizability.

Results: The system used domain-specific language models, BioBERT and BioClinicalBERT, for high-performance therapy mentions identification and RECIST responses extraction and categorization. The best-performing model achieved a 0.66 score in linking therapy and RECIST response mentions, with end-to-end performance peaking at 0.74 after relation normalization, indicating substantial efficacy with room for improvement.

Conclusion: We developed, implemented, and tested an information extraction system from clinical notes for cancer treatment and efficacy assessment information. We expect this system will support future cancer research, particularly oncologic studies that focus on efficiently assessing the effectiveness and reliability of cancer therapeutics.

目的:RECIST 指南为评估癌症对治疗的反应提供了一种标准化方法,可以对不同疗法和患者的治疗效果进行一致的比较。然而,由于临床记录的复杂性和数量,从电子健康记录中手动收集此类信息可能会非常耗费人力和时间。本研究旨在应用自然语言处理(NLP)技术实现这一过程的自动化,最大限度地减少人工数据收集工作,并提高结果的一致性和可靠性:我们提出了一种复杂的混合 NLP 系统,该系统可自动从临床叙事文本中提取、链接和总结抗癌疗法及相关的 RECIST 类反应。该系统由多个基于机器学习/深度学习的模块和基于规则的模块组成,可完成命名实体识别、断言分类、关系提取和文本规范化等多种 NLP 任务,以应对与抗癌疗法和反应信息提取相关的不同挑战。然后,我们对来自不同机构的两个独立测试集进行了系统性能评估,以证明其有效性和通用性:结果:该系统使用了特定领域的语言模型 BioBERT 和 BioClinicalBERT,用于高性能的治疗提法识别和 RECIST 反应提取与分类。表现最好的模型在连接疗法和 RECIST 反应提及方面的得分达到了 0.66,在关系规范化后,端到端的表现达到了 0.74 的峰值,这表明该系统具有很高的效率,但仍有改进的余地:我们开发、实施并测试了一个从临床笔记中提取癌症治疗和疗效评估信息的系统。我们希望该系统能为未来的癌症研究提供支持,尤其是那些专注于高效评估癌症治疗有效性和可靠性的肿瘤研究。
{"title":"Extracting Systemic Anticancer Therapy and Response Information From Clinical Notes Following the RECIST Definition.","authors":"Xu Zuo, Ashok Kumar, Shuhan Shen, Jianfu Li, Grace Cong, Edward Jin, Qingxia Chen, Jeremy L Warner, Ping Yang, Hua Xu","doi":"10.1200/CCI.23.00166","DOIUrl":"https://doi.org/10.1200/CCI.23.00166","url":null,"abstract":"<p><strong>Purpose: </strong>The RECIST guidelines provide a standardized approach for evaluating the response of cancer to treatment, allowing for consistent comparison of treatment efficacy across different therapies and patients. However, collecting such information from electronic health records manually can be extremely labor-intensive and time-consuming because of the complexity and volume of clinical notes. The aim of this study is to apply natural language processing (NLP) techniques to automate this process, minimizing manual data collection efforts, and improving the consistency and reliability of the results.</p><p><strong>Methods: </strong>We proposed a complex, hybrid NLP system that automates the process of extracting, linking, and summarizing anticancer therapy and associated RECIST-like responses from narrative clinical text. The system consists of multiple machine learning-/deep learning-based and rule-based modules for diverse NLP tasks such as named entity recognition, assertion classification, relation extraction, and text normalization, to address different challenges associated with anticancer therapy and response information extraction. We then evaluated the system performances on two independent test sets from different institutions to demonstrate its effectiveness and generalizability.</p><p><strong>Results: </strong>The system used domain-specific language models, BioBERT and BioClinicalBERT, for high-performance therapy mentions identification and RECIST responses extraction and categorization. The best-performing model achieved a 0.66 score in linking therapy and RECIST response mentions, with end-to-end performance peaking at 0.74 after relation normalization, indicating substantial efficacy with room for improvement.</p><p><strong>Conclusion: </strong>We developed, implemented, and tested an information extraction system from clinical notes for cancer treatment and efficacy assessment information. We expect this system will support future cancer research, particularly oncologic studies that focus on efficiently assessing the effectiveness and reliability of cancer therapeutics.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2300166"},"PeriodicalIF":4.2,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141421853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extraction of Unstructured Electronic Health Records to Evaluate Glioblastoma Treatment Patterns. 提取非结构化电子健康记录,评估胶质母细胞瘤治疗模式。
IF 3.3 Q2 ONCOLOGY Pub Date : 2024-06-01 DOI: 10.1200/CCI.23.00091
Akshay Swaminathan, Alexander L Ren, Janet Y Wu, Aarohi Bhargava-Shah, Ivan Lopez, Ujwal Srivastava, Vassilis Alexopoulos, Rebecca Pizzitola, Brandon Bui, Layth Alkhani, Susan Lee, Nathan Mohit, Noel Seo, Nicholas Macedo, Winson Cheng, William Wang, Edward Tran, Reena Thomas, Olivier Gevaert

Purpose: Data on lines of therapy (LOTs) for cancer treatment are important for clinical oncology research, but LOTs are not explicitly recorded in electronic health records (EHRs). We present an efficient approach for clinical data abstraction and a flexible algorithm to derive LOTs from EHR-based medication data on patients with glioblastoma multiforme (GBM).

Methods: Nonclinicians were trained to abstract the diagnosis of GBM from EHRs, and their accuracy was compared with abstraction performed by clinicians. The resulting data were used to build a cohort of patients with confirmed GBM diagnosis. An algorithm was developed to derive LOTs using structured medication data, accounting for the addition and discontinuation of therapies and drug class. Descriptive statistics were calculated and time-to-next-treatment (TTNT) analysis was performed using the Kaplan-Meier method.

Results: Treating clinicians as the gold standard, nonclinicians abstracted GBM diagnosis with a sensitivity of 0.98, specificity 1.00, positive predictive value 1.00, and negative predictive value 0.90, suggesting that nonclinician abstraction of GBM diagnosis was comparable with clinician abstraction. Of 693 patients with a confirmed diagnosis of GBM, 246 patients contained structured information about the types of medications received. Of them, 165 (67.1%) received a first-line therapy (1L) of temozolomide, and the median TTNT from the start of 1L was 179 days.

Conclusion: We described a workflow for extracting diagnosis of GBM and LOT from EHR data that combines nonclinician abstraction with algorithmic processing, demonstrating comparable accuracy with clinician abstraction and highlighting the potential for scalable and efficient EHR-based oncology research.

目的:癌症治疗线(LOT)数据对于临床肿瘤学研究非常重要,但电子健康记录(EHR)中并没有明确记录治疗线。我们提出了一种高效的临床数据抽取方法和一种灵活的算法,可从基于电子病历的多形性胶质母细胞瘤(GBM)患者用药数据中得出治疗方案:方法: 培训非临床医生从电子病历中抽取 GBM 诊断数据,并将其准确性与临床医生抽取数据的准确性进行比较。所得数据用于建立确诊为 GBM 的患者队列。我们开发了一种算法,利用结构化的用药数据推导出 LOTs,并考虑到疗法和药物类别的增加和中断。计算了描述性统计数字,并采用卡普兰-梅耶法进行了下次治疗时间(TTNT)分析:将临床医生作为金标准,非临床医生抽取 GBM 诊断的敏感性为 0.98,特异性为 1.00,阳性预测值为 1.00,阴性预测值为 0.90,表明非临床医生抽取 GBM 诊断与临床医生抽取相当。在 693 名确诊为 GBM 的患者中,有 246 名患者提供了有关所接受药物类型的结构化信息。其中,165 人(67.1%)接受了替莫唑胺一线治疗(1L),从 1L 开始的中位 TTNT 为 179 天:我们描述了一种从电子病历数据中提取 GBM 和 LOT 诊断的工作流程,该流程将非临床医生抽取与算法处理相结合,显示出与临床医生抽取相当的准确性,并突出了基于电子病历的可扩展、高效的肿瘤学研究的潜力。
{"title":"Extraction of Unstructured Electronic Health Records to Evaluate Glioblastoma Treatment Patterns.","authors":"Akshay Swaminathan, Alexander L Ren, Janet Y Wu, Aarohi Bhargava-Shah, Ivan Lopez, Ujwal Srivastava, Vassilis Alexopoulos, Rebecca Pizzitola, Brandon Bui, Layth Alkhani, Susan Lee, Nathan Mohit, Noel Seo, Nicholas Macedo, Winson Cheng, William Wang, Edward Tran, Reena Thomas, Olivier Gevaert","doi":"10.1200/CCI.23.00091","DOIUrl":"10.1200/CCI.23.00091","url":null,"abstract":"<p><strong>Purpose: </strong>Data on lines of therapy (LOTs) for cancer treatment are important for clinical oncology research, but LOTs are not explicitly recorded in electronic health records (EHRs). We present an efficient approach for clinical data abstraction and a flexible algorithm to derive LOTs from EHR-based medication data on patients with glioblastoma multiforme (GBM).</p><p><strong>Methods: </strong>Nonclinicians were trained to abstract the diagnosis of GBM from EHRs, and their accuracy was compared with abstraction performed by clinicians. The resulting data were used to build a cohort of patients with confirmed GBM diagnosis. An algorithm was developed to derive LOTs using structured medication data, accounting for the addition and discontinuation of therapies and drug class. Descriptive statistics were calculated and time-to-next-treatment (TTNT) analysis was performed using the Kaplan-Meier method.</p><p><strong>Results: </strong>Treating clinicians as the gold standard, nonclinicians abstracted GBM diagnosis with a sensitivity of 0.98, specificity 1.00, positive predictive value 1.00, and negative predictive value 0.90, suggesting that nonclinician abstraction of GBM diagnosis was comparable with clinician abstraction. Of 693 patients with a confirmed diagnosis of GBM, 246 patients contained structured information about the types of medications received. Of them, 165 (67.1%) received a first-line therapy (1L) of temozolomide, and the median TTNT from the start of 1L was 179 days.</p><p><strong>Conclusion: </strong>We described a workflow for extracting diagnosis of GBM and LOT from EHR data that combines nonclinician abstraction with algorithmic processing, demonstrating comparable accuracy with clinician abstraction and highlighting the potential for scalable and efficient EHR-based oncology research.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"8 ","pages":"e2300091"},"PeriodicalIF":3.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11371099/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141302007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JCO Clinical Cancer Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1