An interpretable machine learning approach for detecting psoriatic arthritis in a UK primary care psoriasis cohort using electronic health records from the Clinical Practice Research Datalink.

IF 20.6 1区 医学 Q1 RHEUMATOLOGY Annals of the Rheumatic Diseases Pub Date : 2025-04-01 Epub Date: 2025-03-01 DOI:10.1016/j.ard.2025.01.051
Alexander Rudge, Neil McHugh, William Tillett, Theresa Smith
{"title":"An interpretable machine learning approach for detecting psoriatic arthritis in a UK primary care psoriasis cohort using electronic health records from the Clinical Practice Research Datalink.","authors":"Alexander Rudge, Neil McHugh, William Tillett, Theresa Smith","doi":"10.1016/j.ard.2025.01.051","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Develop an interpretable machine learning model to detect patients with newly diagnosed psoriatic arthritis (PsA) in a cohort of psoriasis patients and identify important clinical indicators of PsA in primary care.</p><p><strong>Methods: </strong>We developed models using UK primary care electronic health records from the Clinical Practice Research Datalink (CPRD). The study population consisted of a cohort of (PsA free) patients with incident psoriasis who were followed prospectively. We used Bayesian networks (BNs) to identify patients who developed PsA using primary care variables measured prior to diagnosis and compared the results to a random forest (RF). Variables included patient demographics, musculoskeletal symptoms, blood tests, and prescriptions. The importance of each variable used in the models was evaluated using permutation variable importance. Model discrimination was measured using the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (PRAUC).</p><p><strong>Results: </strong>We identified a cohort of 122,330 patients with an incident psoriasis diagnosis between 1998 and 2019 in the CPRD, of whom 2460 patients went on to develop PsA. Our best BN achieved an AUC of 0.823, and PRAUC of 0.221, compared to the AUC of 0.851 and PRAUC of 0.261 of the RF. Psoriasis duration, nonsteroidal anti-inflammatory drug prescriptions, nonspecific arthritis, nonspecific arthralgia, and C-reactive protein blood tests were all important variables in our models.</p><p><strong>Conclusions: </strong>We were able to identify psoriasis patients at higher risk, and important indicators, of PsA in UK primary care. Further work is required to evaluate our model's usefulness in assisting PsA screening.</p>","PeriodicalId":8087,"journal":{"name":"Annals of the Rheumatic Diseases","volume":" ","pages":"575-583"},"PeriodicalIF":20.6000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of the Rheumatic Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ard.2025.01.051","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: Develop an interpretable machine learning model to detect patients with newly diagnosed psoriatic arthritis (PsA) in a cohort of psoriasis patients and identify important clinical indicators of PsA in primary care.

Methods: We developed models using UK primary care electronic health records from the Clinical Practice Research Datalink (CPRD). The study population consisted of a cohort of (PsA free) patients with incident psoriasis who were followed prospectively. We used Bayesian networks (BNs) to identify patients who developed PsA using primary care variables measured prior to diagnosis and compared the results to a random forest (RF). Variables included patient demographics, musculoskeletal symptoms, blood tests, and prescriptions. The importance of each variable used in the models was evaluated using permutation variable importance. Model discrimination was measured using the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (PRAUC).

Results: We identified a cohort of 122,330 patients with an incident psoriasis diagnosis between 1998 and 2019 in the CPRD, of whom 2460 patients went on to develop PsA. Our best BN achieved an AUC of 0.823, and PRAUC of 0.221, compared to the AUC of 0.851 and PRAUC of 0.261 of the RF. Psoriasis duration, nonsteroidal anti-inflammatory drug prescriptions, nonspecific arthritis, nonspecific arthralgia, and C-reactive protein blood tests were all important variables in our models.

Conclusions: We were able to identify psoriasis patients at higher risk, and important indicators, of PsA in UK primary care. Further work is required to evaluate our model's usefulness in assisting PsA screening.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种可解释的机器学习方法,用于使用来自临床实践研究数据链的电子健康记录检测英国初级保健银屑病队列中的银屑病关节炎。
目标:开发一种可解释的机器学习模型,以检测牛皮癣患者队列中新诊断出的银屑病关节炎(PsA)患者,并确定初级治疗中的重要临床指标:开发一种可解释的机器学习模型,以检测银屑病患者队列中新诊断出的银屑病关节炎(PsA)患者,并确定初级保健中 PsA 的重要临床指标:我们利用临床实践研究数据链接(CPRD)中的英国初级保健电子健康记录开发了模型。研究对象包括一组前瞻性跟踪的(无 PsA)银屑病患者。我们使用贝叶斯网络(BN),利用诊断前测量的初级保健变量来识别罹患 PsA 的患者,并将结果与随机森林(RF)进行比较。变量包括患者人口统计学特征、肌肉骨骼症状、血液化验和处方。模型中使用的每个变量的重要性都是通过置换变量重要性来评估的。使用接收者操作特征曲线下面积(AUC)和精确度-召回曲线下面积(PRAUC)测量模型的区分度:我们在 CPRD 中确定了 1998 年至 2019 年期间 122,330 例银屑病诊断患者,其中 2460 例患者后来发展为 PsA。我们的最佳BN的AUC为0.823,PRAUC为0.221,而RF的AUC为0.851,PRAUC为0.261。牛皮癣病程、非甾体抗炎药处方、非特异性关节炎、非特异性关节痛和C反应蛋白血检都是我们模型中的重要变量:结论:我们能够识别英国初级医疗机构中PsA风险较高的银屑病患者和重要指标。还需要进一步的工作来评估我们的模型在协助 PsA 筛查方面的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Annals of the Rheumatic Diseases
Annals of the Rheumatic Diseases 医学-风湿病学
CiteScore
35.00
自引率
9.90%
发文量
3728
审稿时长
1.4 months
期刊介绍: Annals of the Rheumatic Diseases (ARD) is an international peer-reviewed journal covering all aspects of rheumatology, which includes the full spectrum of musculoskeletal conditions, arthritic disease, and connective tissue disorders. ARD publishes basic, clinical, and translational scientific research, including the most important recommendations for the management of various conditions.
期刊最新文献
The evolving landscape of complement therapeutics in rheumatology. Musculoskeletal ultrasound findings in people with asymptomatic hyperuricemia: baseline analysis from the Transitions in Gout Research (TIGER) study using the OMERACT gout ultrasound semiquantitative scoring system. Altered mitochondrial ultrastructure in salivary epithelial cells of patients with Sjögren's disease is associated with mitochondrial DNA release and increased activation of pattern recognition receptors: potential use of tofacitinib as therapy. Human hypofunctional NCF1 variant aggravates salivary gland immunopathology in Sjögren's disease by promoting switched memory B-cell recruitment and long-lived plasma cell differentiation. Transmembrane TNF-α signalling of macrophages drives pathological osteogenesis in radiographic axial spondyloarthritis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1