Challenge of missing data in observational studies: investigating cross-sectional imputation methods for assessing disease activity in axial spondyloarthritis.
Stylianos Georgiadis, Marion Pons, Simon Rasmussen, Merete Lund Hetland, Louise Linde, Daniela di Giuseppe, Brigitte Michelsen, Johan K Wallman, Tor Olofsson, Jakub Zavada, Bente Glintborg, Anne G Loft, Catalin Codreanu, Daniel Melim, Diogo Almeida, Sella Aarrestad Provan, Tore K Kvien, Vappu Rantalaiho, Ritva Peltomaa, Bjorn Gudbjornsson, Olafur Palsson, Ovidiu Rotariu, Ross MacDonald, Ziga Rotar, Katja Perdan Pirkmajer, Karin Lass, Florenzo Iannone, Adrian Ciurea, Mikkel Østergaard, L M Ørnbjerg
{"title":"Challenge of missing data in observational studies: investigating cross-sectional imputation methods for assessing disease activity in axial spondyloarthritis.","authors":"Stylianos Georgiadis, Marion Pons, Simon Rasmussen, Merete Lund Hetland, Louise Linde, Daniela di Giuseppe, Brigitte Michelsen, Johan K Wallman, Tor Olofsson, Jakub Zavada, Bente Glintborg, Anne G Loft, Catalin Codreanu, Daniel Melim, Diogo Almeida, Sella Aarrestad Provan, Tore K Kvien, Vappu Rantalaiho, Ritva Peltomaa, Bjorn Gudbjornsson, Olafur Palsson, Ovidiu Rotariu, Ross MacDonald, Ziga Rotar, Katja Perdan Pirkmajer, Karin Lass, Florenzo Iannone, Adrian Ciurea, Mikkel Østergaard, L M Ørnbjerg","doi":"10.1136/rmdopen-2024-004844","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>We aimed to compare various methods for imputing disease activity in longitudinally collected observational data of patients with axial spondyloarthritis (axSpA).</p><p><strong>Methods: </strong>We conducted a simulation study on data from 8583 axSpA patients from ten European registries. Disease activity was assessed by the Axial Spondyloarthritis Disease Activity Score (ASDAS) and the corresponding low disease activity (LDA; ASDAS<2.1) state at baseline, 6 and 12 months. We focused on cross-sectional methods which impute missing values of an individual at a particular time point based on the available information from other individuals at that time point. We applied nine single and five multiple imputation methods, covering mean, regression and hot deck methods. The performance of each imputation method was evaluated via relative bias and coverage of 95% confidence intervals for the mean ASDAS and the derived proportion of patients in LDA.</p><p><strong>Results: </strong>Hot deck imputation methods outperformed mean and regression methods, particularly when assessing LDA. Multiple imputation procedures provided better coverage than the corresponding single imputation ones. However, none of the evaluated methods produced unbiased estimates with adequate coverage across all time points, with performance for missing baseline data being worse than for missing follow-up data. Predictive mean and weighted predictive mean hot deck imputation procedures consistently provided results with low bias.</p><p><strong>Conclusions: </strong>This study contributes to the available methods for imputing disease activity in observational research. Hot deck imputation using predictive mean matching exhibited the highest robustness and is thus our suggested approach.</p>","PeriodicalId":21396,"journal":{"name":"RMD Open","volume":"11 1","pages":""},"PeriodicalIF":5.1000,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11843021/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"RMD Open","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/rmdopen-2024-004844","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: We aimed to compare various methods for imputing disease activity in longitudinally collected observational data of patients with axial spondyloarthritis (axSpA).
Methods: We conducted a simulation study on data from 8583 axSpA patients from ten European registries. Disease activity was assessed by the Axial Spondyloarthritis Disease Activity Score (ASDAS) and the corresponding low disease activity (LDA; ASDAS<2.1) state at baseline, 6 and 12 months. We focused on cross-sectional methods which impute missing values of an individual at a particular time point based on the available information from other individuals at that time point. We applied nine single and five multiple imputation methods, covering mean, regression and hot deck methods. The performance of each imputation method was evaluated via relative bias and coverage of 95% confidence intervals for the mean ASDAS and the derived proportion of patients in LDA.
Results: Hot deck imputation methods outperformed mean and regression methods, particularly when assessing LDA. Multiple imputation procedures provided better coverage than the corresponding single imputation ones. However, none of the evaluated methods produced unbiased estimates with adequate coverage across all time points, with performance for missing baseline data being worse than for missing follow-up data. Predictive mean and weighted predictive mean hot deck imputation procedures consistently provided results with low bias.
Conclusions: This study contributes to the available methods for imputing disease activity in observational research. Hot deck imputation using predictive mean matching exhibited the highest robustness and is thus our suggested approach.
期刊介绍:
RMD Open publishes high quality peer-reviewed original research covering the full spectrum of musculoskeletal disorders, rheumatism and connective tissue diseases, including osteoporosis, spine and rehabilitation. Clinical and epidemiological research, basic and translational medicine, interesting clinical cases, and smaller studies that add to the literature are all considered.