Merete Lund Hetland, Anja Strangfeld, Gianluca Bonfanti, Dimitrios Soudis, J Jasper Deuring, Roger A Edwards
{"title":"使用托法替尼治疗类风湿性关节炎患者严重感染的机器学习预测和解释模型。","authors":"Merete Lund Hetland, Anja Strangfeld, Gianluca Bonfanti, Dimitrios Soudis, J Jasper Deuring, Roger A Edwards","doi":"10.1186/s13075-024-03376-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Patients with rheumatoid arthritis (RA) have an increased risk of developing serious infections (SIs) vs. individuals without RA; efforts to predict SIs in this patient group are ongoing. We assessed the ability of different machine learning modeling approaches to predict SIs using baseline data from the tofacitinib RA clinical trials program.</p><p><strong>Methods: </strong>This analysis included data from 19 clinical trials (phase 2, n = 10; phase 3, n = 6; phase 3b/4, n = 3). Patients with RA receiving tofacitinib 5 or 10 mg twice daily (BID) were included in the analysis; patients receiving tofacitinib 11 mg once daily were considered as tofacitinib 5 mg BID. All available patient-level baseline variables were extracted. Statistical and machine learning methods (logistic regression, support vector machines with linear kernel, random forest, extreme gradient boosting trees, and boosted trees) were implemented to assess the association of baseline variables with SI (logistic regression only), and to predict SI using selected baseline variables using 5-fold cross-validation. Missing values were handled individually per prediction model.</p><p><strong>Results: </strong>A total of 8404 patients with RA treated with tofacitinib were eligible for inclusion (15,310 patient-years of total follow-up) of which 473 patients reported SIs. Amongst other baseline factors, age, previous infection, and corticosteroid use were significantly associated with SI. When applying prediction modeling for SI across data from all studies, the area under the receiver operating characteristic (AUROC) curve ranged from 0.656 to 0.739. AUROC values ranged from 0.599 to 0.730 in data from phase 3 and 3b/4 studies, and from 0.563 to 0.643 in data from ORAL Surveillance only.</p><p><strong>Conclusions: </strong>Baseline factors associated with SIs in the tofacitinib RA clinical trial program were similar to established SI risk factors associated with advanced treatments for RA. Furthermore, while model performance in predicting SI was similar to other published models, this did not meet the threshold for accurate prediction (AUROC > 0.85). Thus, predicting the occurrence of SIs at baseline remains challenging and may be complicated by the changing disease course of RA over time. Inclusion of other patient-associated and healthcare delivery-related factors and harmonization of the duration of studies included in the models may be required to improve prediction.</p><p><strong>Trial registration: </strong>ClinicalTrials.gov: NCT00147498; NCT00413660; NCT00550446; NCT00603512; NCT00687193; NCT01164579; NCT00976599; NCT01059864; NCT01359150; NCT02147587; NCT00960440; NCT00847613; NCT00814307; NCT00856544; NCT00853385; NCT01039688; NCT02187055; NCT02831855; NCT02092467.</p>","PeriodicalId":8419,"journal":{"name":"Arthritis Research & Therapy","volume":"26 1","pages":"153"},"PeriodicalIF":4.9000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11348567/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning prediction and explanatory models of serious infections in patients with rheumatoid arthritis treated with tofacitinib.\",\"authors\":\"Merete Lund Hetland, Anja Strangfeld, Gianluca Bonfanti, Dimitrios Soudis, J Jasper Deuring, Roger A Edwards\",\"doi\":\"10.1186/s13075-024-03376-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Patients with rheumatoid arthritis (RA) have an increased risk of developing serious infections (SIs) vs. individuals without RA; efforts to predict SIs in this patient group are ongoing. We assessed the ability of different machine learning modeling approaches to predict SIs using baseline data from the tofacitinib RA clinical trials program.</p><p><strong>Methods: </strong>This analysis included data from 19 clinical trials (phase 2, n = 10; phase 3, n = 6; phase 3b/4, n = 3). Patients with RA receiving tofacitinib 5 or 10 mg twice daily (BID) were included in the analysis; patients receiving tofacitinib 11 mg once daily were considered as tofacitinib 5 mg BID. All available patient-level baseline variables were extracted. Statistical and machine learning methods (logistic regression, support vector machines with linear kernel, random forest, extreme gradient boosting trees, and boosted trees) were implemented to assess the association of baseline variables with SI (logistic regression only), and to predict SI using selected baseline variables using 5-fold cross-validation. Missing values were handled individually per prediction model.</p><p><strong>Results: </strong>A total of 8404 patients with RA treated with tofacitinib were eligible for inclusion (15,310 patient-years of total follow-up) of which 473 patients reported SIs. Amongst other baseline factors, age, previous infection, and corticosteroid use were significantly associated with SI. When applying prediction modeling for SI across data from all studies, the area under the receiver operating characteristic (AUROC) curve ranged from 0.656 to 0.739. AUROC values ranged from 0.599 to 0.730 in data from phase 3 and 3b/4 studies, and from 0.563 to 0.643 in data from ORAL Surveillance only.</p><p><strong>Conclusions: </strong>Baseline factors associated with SIs in the tofacitinib RA clinical trial program were similar to established SI risk factors associated with advanced treatments for RA. Furthermore, while model performance in predicting SI was similar to other published models, this did not meet the threshold for accurate prediction (AUROC > 0.85). Thus, predicting the occurrence of SIs at baseline remains challenging and may be complicated by the changing disease course of RA over time. Inclusion of other patient-associated and healthcare delivery-related factors and harmonization of the duration of studies included in the models may be required to improve prediction.</p><p><strong>Trial registration: </strong>ClinicalTrials.gov: NCT00147498; NCT00413660; NCT00550446; NCT00603512; NCT00687193; NCT01164579; NCT00976599; NCT01059864; NCT01359150; NCT02147587; NCT00960440; NCT00847613; NCT00814307; NCT00856544; NCT00853385; NCT01039688; NCT02187055; NCT02831855; NCT02092467.</p>\",\"PeriodicalId\":8419,\"journal\":{\"name\":\"Arthritis Research & Therapy\",\"volume\":\"26 1\",\"pages\":\"153\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11348567/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Arthritis Research & Therapy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s13075-024-03376-9\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arthritis Research & Therapy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s13075-024-03376-9","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
摘要
背景:类风湿性关节炎(RA)患者发生严重感染(SIs)的风险比没有RA的患者高;预测该患者群体SIs的工作正在进行中。我们利用托法替尼 RA 临床试验项目的基线数据评估了不同机器学习建模方法预测 SI 的能力:该分析包括来自 19 项临床试验(2 期,n = 10;3 期,n = 6;3b/4 期,n = 3)的数据。接受托法替尼5或10毫克、每日两次(BID)治疗的RA患者被纳入分析;接受托法替尼11毫克、每日一次治疗的患者被视为托法替尼5毫克、每日两次。提取了所有可用的患者水平基线变量。采用统计和机器学习方法(逻辑回归、线性核支持向量机、随机森林、极梯度提升树和提升树)评估基线变量与 SI 的关联(仅逻辑回归),并使用选定的基线变量通过 5 倍交叉验证预测 SI。每个预测模型单独处理缺失值:共有8404名接受托法替尼治疗的RA患者符合纳入条件(总随访时间为15310患者年),其中473名患者报告了SI。在其他基线因素中,年龄、既往感染和皮质类固醇的使用与SI显著相关。在对所有研究数据进行 SI 预测建模时,接收者操作特征曲线下面积 (AUROC) 为 0.656 至 0.739。3期和3b/4期研究数据的AUROC值介于0.599至0.730之间,仅ORAL监测数据的AUROC值介于0.563至0.643之间:结论:托法替尼RA临床试验项目中与SI相关的基线因素与已确定的与RA晚期治疗相关的SI风险因素相似。此外,虽然预测SI的模型性能与其他已发表的模型相似,但并未达到准确预测的阈值(AUROC > 0.85)。因此,预测基线SI的发生仍然具有挑战性,而且随着时间的推移,RA的病程变化可能会使预测变得更加复杂。可能需要纳入其他患者相关因素和医疗服务相关因素,并统一模型中的研究持续时间,以提高预测效果:试验注册:ClinicalTrials.gov:NCT00147498;NCT00413660;NCT00550446;NCT00603512;NCT00687193;NCT01164579;NCT00976599;NCT01059864;NCT01359150;NCT02147587;NCT00960440;NCT00847613;NCT00814307;NCT00856544;NCT00853385;NCT01039688;NCT02187055;NCT02831855;NCT02092467。
Machine learning prediction and explanatory models of serious infections in patients with rheumatoid arthritis treated with tofacitinib.
Background: Patients with rheumatoid arthritis (RA) have an increased risk of developing serious infections (SIs) vs. individuals without RA; efforts to predict SIs in this patient group are ongoing. We assessed the ability of different machine learning modeling approaches to predict SIs using baseline data from the tofacitinib RA clinical trials program.
Methods: This analysis included data from 19 clinical trials (phase 2, n = 10; phase 3, n = 6; phase 3b/4, n = 3). Patients with RA receiving tofacitinib 5 or 10 mg twice daily (BID) were included in the analysis; patients receiving tofacitinib 11 mg once daily were considered as tofacitinib 5 mg BID. All available patient-level baseline variables were extracted. Statistical and machine learning methods (logistic regression, support vector machines with linear kernel, random forest, extreme gradient boosting trees, and boosted trees) were implemented to assess the association of baseline variables with SI (logistic regression only), and to predict SI using selected baseline variables using 5-fold cross-validation. Missing values were handled individually per prediction model.
Results: A total of 8404 patients with RA treated with tofacitinib were eligible for inclusion (15,310 patient-years of total follow-up) of which 473 patients reported SIs. Amongst other baseline factors, age, previous infection, and corticosteroid use were significantly associated with SI. When applying prediction modeling for SI across data from all studies, the area under the receiver operating characteristic (AUROC) curve ranged from 0.656 to 0.739. AUROC values ranged from 0.599 to 0.730 in data from phase 3 and 3b/4 studies, and from 0.563 to 0.643 in data from ORAL Surveillance only.
Conclusions: Baseline factors associated with SIs in the tofacitinib RA clinical trial program were similar to established SI risk factors associated with advanced treatments for RA. Furthermore, while model performance in predicting SI was similar to other published models, this did not meet the threshold for accurate prediction (AUROC > 0.85). Thus, predicting the occurrence of SIs at baseline remains challenging and may be complicated by the changing disease course of RA over time. Inclusion of other patient-associated and healthcare delivery-related factors and harmonization of the duration of studies included in the models may be required to improve prediction.
期刊介绍:
Established in 1999, Arthritis Research and Therapy is an international, open access, peer-reviewed journal, publishing original articles in the area of musculoskeletal research and therapy as well as, reviews, commentaries and reports. A major focus of the journal is on the immunologic processes leading to inflammation, damage and repair as they relate to autoimmune rheumatic and musculoskeletal conditions, and which inform the translation of this knowledge into advances in clinical care. Original basic, translational and clinical research is considered for publication along with results of early and late phase therapeutic trials, especially as they pertain to the underpinning science that informs clinical observations in interventional studies.