Elizabeth Wragg, Caroline Skirrow, Pasquale Dente, Jack Cotter, Peter Annas, Milly Lowther, Rosa Backx, Jenny Barnett, Fiona Cree, Jasmin Kroll, Francesca Cormack
{"title":"Generating normative data from web-based administration of the Cambridge Neuropsychological Test Automated Battery using a Bayesian framework.","authors":"Elizabeth Wragg, Caroline Skirrow, Pasquale Dente, Jack Cotter, Peter Annas, Milly Lowther, Rosa Backx, Jenny Barnett, Fiona Cree, Jasmin Kroll, Francesca Cormack","doi":"10.3389/fdgth.2024.1294222","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Normative cognitive data can distinguish impairment from healthy cognitive function and pathological decline from normal ageing. Traditional methods for deriving normative data typically require extremely large samples of healthy participants, stratifying test variation by pre-specified age groups and key demographic features (age, sex, education). Linear regression approaches can provide normative data from more sparsely sampled datasets, but non-normal distributions of many cognitive test results may lead to violation of model assumptions, limiting generalisability.</p><p><strong>Method: </strong>The current study proposes a novel Bayesian framework for normative data generation. Participants (<i>n</i> = 728; 368 male and 360 female, age 18-75 years), completed the Cambridge Neuropsychological Test Automated Battery via the research crowdsourcing website Prolific.ac. Participants completed tests of visuospatial recognition memory (Spatial Working Memory test), visual episodic memory (Paired Associate Learning test) and sustained attention (Rapid Visual Information Processing test). Test outcomes were modelled as a function of age using Bayesian Generalised Linear Models, which were able to derive posterior distributions of the authentic data, drawing from a wide family of distributions. Markov Chain Monte Carlo algorithms generated a large synthetic dataset from posterior distributions for each outcome measure, capturing normative distributions of cognition as a function of age, sex and education.</p><p><strong>Results: </strong>Comparison with stratified and linear regression methods showed converging results, with the Bayesian approach producing similar age, sex and education trends in the data, and similar categorisation of individual performance levels.</p><p><strong>Conclusion: </strong>This study documents a novel, reproducible and robust method for describing normative cognitive performance with ageing using a large dataset.</p>","PeriodicalId":73078,"journal":{"name":"Frontiers in digital health","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11451437/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in digital health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fdgth.2024.1294222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Normative cognitive data can distinguish impairment from healthy cognitive function and pathological decline from normal ageing. Traditional methods for deriving normative data typically require extremely large samples of healthy participants, stratifying test variation by pre-specified age groups and key demographic features (age, sex, education). Linear regression approaches can provide normative data from more sparsely sampled datasets, but non-normal distributions of many cognitive test results may lead to violation of model assumptions, limiting generalisability.
Method: The current study proposes a novel Bayesian framework for normative data generation. Participants (n = 728; 368 male and 360 female, age 18-75 years), completed the Cambridge Neuropsychological Test Automated Battery via the research crowdsourcing website Prolific.ac. Participants completed tests of visuospatial recognition memory (Spatial Working Memory test), visual episodic memory (Paired Associate Learning test) and sustained attention (Rapid Visual Information Processing test). Test outcomes were modelled as a function of age using Bayesian Generalised Linear Models, which were able to derive posterior distributions of the authentic data, drawing from a wide family of distributions. Markov Chain Monte Carlo algorithms generated a large synthetic dataset from posterior distributions for each outcome measure, capturing normative distributions of cognition as a function of age, sex and education.
Results: Comparison with stratified and linear regression methods showed converging results, with the Bayesian approach producing similar age, sex and education trends in the data, and similar categorisation of individual performance levels.
Conclusion: This study documents a novel, reproducible and robust method for describing normative cognitive performance with ageing using a large dataset.