What's in a score: A longitudinal investigation of scores based on item response theory and classical test theory for the Amsterdam Instrumental Activities of Daily Living Questionnaire in cognitively normal and impaired older adults.
Mark A Dubbelman, Merel C Postema, Roos J Jutten, John E Harrison, Craig W Ritchie, André Aleman, Frank Jan de Jong, Benjamin D Schalet, Caroline B Terwee, Wiesje M van der Flier, Philip Scheltens, Sietske A M Sikkes
{"title":"What's in a score: A longitudinal investigation of scores based on item response theory and classical test theory for the Amsterdam Instrumental Activities of Daily Living Questionnaire in cognitively normal and impaired older adults.","authors":"Mark A Dubbelman, Merel C Postema, Roos J Jutten, John E Harrison, Craig W Ritchie, André Aleman, Frank Jan de Jong, Benjamin D Schalet, Caroline B Terwee, Wiesje M van der Flier, Philip Scheltens, Sietske A M Sikkes","doi":"10.1037/neu0000914","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>We aimed to investigate whether item response theory (IRT)-based scoring allows for a more accurate, responsive, and less biased assessment of everyday functioning than traditional classical test theory (CTT)-based scoring, as measured with the Amsterdam Instrumental Activities of Daily Living Questionnaire.</p><p><strong>Method: </strong>In this longitudinal multicenter study including cognitively normal and impaired individuals, we examined IRT-based and CTT-based score distributions and differences between diagnostic groups using linear regressions, and investigated scale attenuation. We compared change over time between scoring methods using linear mixed models with random intercepts and slopes for time.</p><p><strong>Results: </strong>Two thousand two hundred ninety-four participants were included (66.6 ± 7.7 years, 54% female): <i>n</i> = 2,032 (89%) with normal cognition, <i>n</i> = 93 (4%) with subjective cognitive decline, <i>n</i> = 79 (3%) with mild cognitive impairment, and <i>n</i> = 91 (4%) with dementia. At baseline, IRT-based and CTT-based scores were highly correlated (<i>r</i> = -0.92). IRT-based scores showed less scale attenuation than CTT-based scores. In a subsample of <i>n</i> = 1,145 (62%) who were followed for a mean of 1.3 (<i>SD</i> = 0.6) years, IRT-based scores declined significantly among cognitively normal individuals (unstandardized coefficient [<i>B</i>] = -0.15, 95% confidence interval, 95% CI [-0.28, -0.03], effect size = -0.02), whereas CTT-based scores did not (<i>B</i> = 0.20, 95% CI [-0.02, 0.41], effect size = 0.02). In the other diagnostic groups, effect sizes of change over time were similar.</p><p><strong>Conclusions: </strong>IRT-based scores were less affected by scale attenuation than CTT-based scores. With regard to responsiveness, IRT-based scores showed more signal than CTT-based scores in early disease stages, highlighting the IRT-based scores' superior suitability for use in preclinical populations. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":19205,"journal":{"name":"Neuropsychology","volume":null,"pages":null},"PeriodicalIF":2.6000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuropsychology","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/neu0000914","RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/9/7 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: We aimed to investigate whether item response theory (IRT)-based scoring allows for a more accurate, responsive, and less biased assessment of everyday functioning than traditional classical test theory (CTT)-based scoring, as measured with the Amsterdam Instrumental Activities of Daily Living Questionnaire.
Method: In this longitudinal multicenter study including cognitively normal and impaired individuals, we examined IRT-based and CTT-based score distributions and differences between diagnostic groups using linear regressions, and investigated scale attenuation. We compared change over time between scoring methods using linear mixed models with random intercepts and slopes for time.
Results: Two thousand two hundred ninety-four participants were included (66.6 ± 7.7 years, 54% female): n = 2,032 (89%) with normal cognition, n = 93 (4%) with subjective cognitive decline, n = 79 (3%) with mild cognitive impairment, and n = 91 (4%) with dementia. At baseline, IRT-based and CTT-based scores were highly correlated (r = -0.92). IRT-based scores showed less scale attenuation than CTT-based scores. In a subsample of n = 1,145 (62%) who were followed for a mean of 1.3 (SD = 0.6) years, IRT-based scores declined significantly among cognitively normal individuals (unstandardized coefficient [B] = -0.15, 95% confidence interval, 95% CI [-0.28, -0.03], effect size = -0.02), whereas CTT-based scores did not (B = 0.20, 95% CI [-0.02, 0.41], effect size = 0.02). In the other diagnostic groups, effect sizes of change over time were similar.
Conclusions: IRT-based scores were less affected by scale attenuation than CTT-based scores. With regard to responsiveness, IRT-based scores showed more signal than CTT-based scores in early disease stages, highlighting the IRT-based scores' superior suitability for use in preclinical populations. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
期刊介绍:
Neuropsychology publishes original, empirical research; systematic reviews and meta-analyses; and theoretical articles on the relation between brain and human cognitive, emotional, and behavioral function.