Quantifying and improving rheumatoid arthritis algorithm performance in biobank settings

IF 4.6 2区 医学 Q1 RHEUMATOLOGY Seminars in arthritis and rheumatism Pub Date : 2025-02-22 DOI:10.1016/j.semarthrit.2025.152668
Vanessa L. Kronzer , Katrina A. Williamson , Andrew C. Hanson , Jennifer A. Sletten , Jeffrey A. Sparks , John M. Davis III , Cynthia S. Crowson
{"title":"Quantifying and improving rheumatoid arthritis algorithm performance in biobank settings","authors":"Vanessa L. Kronzer ,&nbsp;Katrina A. Williamson ,&nbsp;Andrew C. Hanson ,&nbsp;Jennifer A. Sletten ,&nbsp;Jeffrey A. Sparks ,&nbsp;John M. Davis III ,&nbsp;Cynthia S. Crowson","doi":"10.1016/j.semarthrit.2025.152668","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To quantify and improve the performance of standard rheumatoid arthritis (RA) algorithms in a biobank setting.</div></div><div><h3>Methods</h3><div>This retrospective cohort study within the Mayo Clinic (MC) Biobank and MC Tapestry Study identified RA cases by presence of at least two RA codes OR positive anti-cyclic citrullinated peptide antibodies (CCP) plus disease-modifying anti-rheumatic drug (DMARD) prescription as of 7/18/2022. Rheumatology physicians manually verified all RA cases using RA criteria and/or rheumatology physician diagnosis plus DMARD use. All other biobank participants served as non-RA controls. We defined seropositivity as rheumatoid factor and/or anti-CCP positivity. We assessed rules-based and Electronic Medical Records and Genomics (eMERGE) RA algorithms using positive predictive value (PPV). Finally, we developed a novel RA algorithm using a LASSO-based machine learning approach with five-fold cross validation.</div></div><div><h3>Results</h3><div>We identified 1,316 confirmed RA cases (968 MC Biobank, 348 Tapestry, 70 % seropositive) and 82,123 non-RA controls (mean age 65, 61 % female). The PPV of 3 RA codes was 43 %, codes plus DMARD was 54 %, and codes plus DMARD plus seropositivity was 85 %. The PPV of eMERGE was 77 %. Available in the MC Biobank, self-reported RA (PPV 10 %) only minimally improved algorithm performance (PPV from 83 % to 85 %), whereas family history of RA (PPV 3 %) worsened performance. At 90 % PPV, the novel RA algorithm incorporating key variables such as anti-CCP and DMARD use increased sensitivity by 4–11 % compared to eMERGE.</div></div><div><h3>Conclusion</h3><div>Rules-based and eMERGE RA algorithms had worse performance in biobank than administrative settings. Our novel RA algorithm outperformed these standard algorithms.</div></div>","PeriodicalId":21715,"journal":{"name":"Seminars in arthritis and rheumatism","volume":"72 ","pages":"Article 152668"},"PeriodicalIF":4.6000,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Seminars in arthritis and rheumatism","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0049017225000393","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective

To quantify and improve the performance of standard rheumatoid arthritis (RA) algorithms in a biobank setting.

Methods

This retrospective cohort study within the Mayo Clinic (MC) Biobank and MC Tapestry Study identified RA cases by presence of at least two RA codes OR positive anti-cyclic citrullinated peptide antibodies (CCP) plus disease-modifying anti-rheumatic drug (DMARD) prescription as of 7/18/2022. Rheumatology physicians manually verified all RA cases using RA criteria and/or rheumatology physician diagnosis plus DMARD use. All other biobank participants served as non-RA controls. We defined seropositivity as rheumatoid factor and/or anti-CCP positivity. We assessed rules-based and Electronic Medical Records and Genomics (eMERGE) RA algorithms using positive predictive value (PPV). Finally, we developed a novel RA algorithm using a LASSO-based machine learning approach with five-fold cross validation.

Results

We identified 1,316 confirmed RA cases (968 MC Biobank, 348 Tapestry, 70 % seropositive) and 82,123 non-RA controls (mean age 65, 61 % female). The PPV of 3 RA codes was 43 %, codes plus DMARD was 54 %, and codes plus DMARD plus seropositivity was 85 %. The PPV of eMERGE was 77 %. Available in the MC Biobank, self-reported RA (PPV 10 %) only minimally improved algorithm performance (PPV from 83 % to 85 %), whereas family history of RA (PPV 3 %) worsened performance. At 90 % PPV, the novel RA algorithm incorporating key variables such as anti-CCP and DMARD use increased sensitivity by 4–11 % compared to eMERGE.

Conclusion

Rules-based and eMERGE RA algorithms had worse performance in biobank than administrative settings. Our novel RA algorithm outperformed these standard algorithms.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.20
自引率
4.00%
发文量
176
审稿时长
46 days
期刊介绍: Seminars in Arthritis and Rheumatism provides access to the highest-quality clinical, therapeutic and translational research about arthritis, rheumatology and musculoskeletal disorders that affect the joints and connective tissue. Each bimonthly issue includes articles giving you the latest diagnostic criteria, consensus statements, systematic reviews and meta-analyses as well as clinical and translational research studies. Read this journal for the latest groundbreaking research and to gain insights from scientists and clinicians on the management and treatment of musculoskeletal and autoimmune rheumatologic diseases. The journal is of interest to rheumatologists, orthopedic surgeons, internal medicine physicians, immunologists and specialists in bone and mineral metabolism.
期刊最新文献
Validation of the polymyalgia rheumatica-activity score: A prospective cohort study What outcomes are important to people with foot and ankle disorders in rheumatic and musculoskeletal diseases? An OMERACT qualitative interview study across four continents Exposure-response relationship of mycophenolic acid in pediatric lupus nephritis patients receiving multi-target therapy: An observational cohort study Quantifying and improving rheumatoid arthritis algorithm performance in biobank settings Classification criteria of joint activity using joint index vector for patients with rheumatoid arthritis: An evaluation and verification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1