Handling of missing component information for common composite score outcomes used in axial spondyloarthritis research when complete-case analysis is unbiased.
Christos Polysopoulos, Stylianos Georgiadis, Lykke Midtbøll Ørnbjerg, Almut Scherer, Daniela Di Giuseppe, Merete Lund Hetland, Michael John Nissen, Gareth T Jones, Bente Glintborg, Anne Gitte Loft, Johan Karlsson Wallman, Karel Pavelka, Jakub Závada, Ayten Yazici, Maria José Santos, Adrian Ciurea, Burkhard Möller, Brigitte Michelsen, Pawel Mielnik, Johanna Huhtakangas, Heikki Relas, Katja Perdan Pirkmajer, Ziga Rotar, Ross MacDonald, Bjorn Gudbjornsson, Irene van der Horst-Bruinsma, Marleen van de Sande, Myriam Riek
{"title":"Handling of missing component information for common composite score outcomes used in axial spondyloarthritis research when complete-case analysis is unbiased.","authors":"Christos Polysopoulos, Stylianos Georgiadis, Lykke Midtbøll Ørnbjerg, Almut Scherer, Daniela Di Giuseppe, Merete Lund Hetland, Michael John Nissen, Gareth T Jones, Bente Glintborg, Anne Gitte Loft, Johan Karlsson Wallman, Karel Pavelka, Jakub Závada, Ayten Yazici, Maria José Santos, Adrian Ciurea, Burkhard Möller, Brigitte Michelsen, Pawel Mielnik, Johanna Huhtakangas, Heikki Relas, Katja Perdan Pirkmajer, Ziga Rotar, Ross MacDonald, Bjorn Gudbjornsson, Irene van der Horst-Bruinsma, Marleen van de Sande, Myriam Riek","doi":"10.1186/s12874-025-02515-3","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Observational data on composite scores often comes with missing component information. When a complete-case (CC) analysis of composite scores is unbiased, preferable approaches of dealing with missing component information should also be unbiased and provide a more precise estimate. We assessed the performance of several methods compared to CC analysis in estimating the means of common composite scores used in axial spondyloarthritis research.</p><p><strong>Methods: </strong>Individual mean imputation (IMI), the modified formula method (MF), overall mean imputation (OMI), and multiple imputation of missing component values (MI) were assessed either analytically or by means of simulations from available data collected across Europe. Their performance in estimating the means of the Bath Ankylosing Spondylitis Disease Activity Index (BASDAI), the Bath Ankylosing Spondylitis Functional Index (BASFI), and the Ankylosing Spondylitis Disease Activity Score based on C-reactive protein (ASDAS-CRP) in cases where component information was set missing completely at random was compared to the CC approach based on bias, variance, and coverage.</p><p><strong>Results: </strong>Like the MF method, IMI uses a modified formula for observations with missing components resulting in modified composite scores. In the case of an unbiased CC approach, these two methods yielded representative samples of the distribution arising from a mixture of the original and modified composite scores, which, however, could not be considered the same as the distribution of the original score. The IMI and MF method are, thus, intrinsically biased. OMI provided an unbiased mean but displayed a complex dependence structure among observations that, if not accounted for, resulted in severe coverage issues. MI improved precision compared to CC and gave unbiased means and proper coverage as long as the extent of missingness was not too large.</p><p><strong>Conclusions: </strong>MI of missing component values was the only method found successful in retaining CC's unbiasedness and in providing increased precision for estimating the means of BASDAI, BASFI, and ASDAS-CRP. However, since MI is susceptible to incorrect implementation and its performance may become questionable with increasing missingness, we consider the implementation of an error-free CC approach a valid and valuable option.</p><p><strong>Trial registration: </strong>Not applicable as study uses data from patient registries.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"55"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11869558/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-025-02515-3","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Observational data on composite scores often comes with missing component information. When a complete-case (CC) analysis of composite scores is unbiased, preferable approaches of dealing with missing component information should also be unbiased and provide a more precise estimate. We assessed the performance of several methods compared to CC analysis in estimating the means of common composite scores used in axial spondyloarthritis research.
Methods: Individual mean imputation (IMI), the modified formula method (MF), overall mean imputation (OMI), and multiple imputation of missing component values (MI) were assessed either analytically or by means of simulations from available data collected across Europe. Their performance in estimating the means of the Bath Ankylosing Spondylitis Disease Activity Index (BASDAI), the Bath Ankylosing Spondylitis Functional Index (BASFI), and the Ankylosing Spondylitis Disease Activity Score based on C-reactive protein (ASDAS-CRP) in cases where component information was set missing completely at random was compared to the CC approach based on bias, variance, and coverage.
Results: Like the MF method, IMI uses a modified formula for observations with missing components resulting in modified composite scores. In the case of an unbiased CC approach, these two methods yielded representative samples of the distribution arising from a mixture of the original and modified composite scores, which, however, could not be considered the same as the distribution of the original score. The IMI and MF method are, thus, intrinsically biased. OMI provided an unbiased mean but displayed a complex dependence structure among observations that, if not accounted for, resulted in severe coverage issues. MI improved precision compared to CC and gave unbiased means and proper coverage as long as the extent of missingness was not too large.
Conclusions: MI of missing component values was the only method found successful in retaining CC's unbiasedness and in providing increased precision for estimating the means of BASDAI, BASFI, and ASDAS-CRP. However, since MI is susceptible to incorrect implementation and its performance may become questionable with increasing missingness, we consider the implementation of an error-free CC approach a valid and valuable option.
Trial registration: Not applicable as study uses data from patient registries.
期刊介绍:
BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.