Alexandra König, Stefanie Köhler, Johannes Tröger, Emrah Düzel, Wenzel Glanz, Michaela Butryn, Elisa Mallick, Josef Priller, Slawek Altenstein, Annika Spottke, Okka Kimmich, Björn Falkenburger, Antje Osterrath, Jens Wiltfang, Claudia Bartels, Ingo Kilimann, Christoph Laske, Matthias H Munk, Sandra Roeske, Ingo Frommann, Daniel C Hoffmann, Frank Jessen, Michael Wagner, Nicklas Linz, Stefan Teipel
{"title":"Automated remote speech-based testing of individuals with cognitive decline: Bayesian agreement of transcription accuracy.","authors":"Alexandra König, Stefanie Köhler, Johannes Tröger, Emrah Düzel, Wenzel Glanz, Michaela Butryn, Elisa Mallick, Josef Priller, Slawek Altenstein, Annika Spottke, Okka Kimmich, Björn Falkenburger, Antje Osterrath, Jens Wiltfang, Claudia Bartels, Ingo Kilimann, Christoph Laske, Matthias H Munk, Sandra Roeske, Ingo Frommann, Daniel C Hoffmann, Frank Jessen, Michael Wagner, Nicklas Linz, Stefan Teipel","doi":"10.1002/dad2.70011","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>We investigated the agreement between automated and gold-standard manual transcriptions of telephone chatbot-based semantic verbal fluency testing.</p><p><strong>Methods: </strong>We examined 78 cases from the Screening over Speech in Unselected Populations for Clinical Trials in AD (PROSPECT-AD) study, including cognitively normal individuals and individuals with subjective cognitive decline, mild cognitive impairment, and dementia. We used Bayesian Bland-Altman analysis of word count and the qualitative features of semantic cluster size, cluster switches, and word frequencies.</p><p><strong>Results: </strong>We found high levels of agreement for word count, with a 93% probability of a newly observed difference being below the minimally important difference. The qualitative features had fair levels of agreement. Word count reached high levels of discrimination between cognitively impaired and unimpaired individuals, regardless of transcription mode.</p><p><strong>Discussion: </strong>Our results support the use of automated speech recognition particularly for the assessment of quantitative speech features, even when using data from telephone calls with cognitively impaired individuals in their homes.</p><p><strong>Highlights: </strong>High levels of agreement were found between automated and gold-standard manual transcriptions of telephone chatbot-based semantic verbal fluency testing, particularly for word count.The qualitative features had fair levels of agreement.Word count reached high levels of discrimination between cognitively impaired and unimpaired individuals, regardless of transcription mode.Automated speech recognition for the assessment of quantitative and qualitative speech features, even when using data from telephone calls with cognitively impaired individuals in their homes, seems feasible and reliable.</p>","PeriodicalId":53226,"journal":{"name":"Alzheimer''s and Dementia: Diagnosis, Assessment and Disease Monitoring","volume":"16 4","pages":"e70011"},"PeriodicalIF":4.0000,"publicationDate":"2024-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11456616/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Alzheimer''s and Dementia: Diagnosis, Assessment and Disease Monitoring","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/dad2.70011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: We investigated the agreement between automated and gold-standard manual transcriptions of telephone chatbot-based semantic verbal fluency testing.
Methods: We examined 78 cases from the Screening over Speech in Unselected Populations for Clinical Trials in AD (PROSPECT-AD) study, including cognitively normal individuals and individuals with subjective cognitive decline, mild cognitive impairment, and dementia. We used Bayesian Bland-Altman analysis of word count and the qualitative features of semantic cluster size, cluster switches, and word frequencies.
Results: We found high levels of agreement for word count, with a 93% probability of a newly observed difference being below the minimally important difference. The qualitative features had fair levels of agreement. Word count reached high levels of discrimination between cognitively impaired and unimpaired individuals, regardless of transcription mode.
Discussion: Our results support the use of automated speech recognition particularly for the assessment of quantitative speech features, even when using data from telephone calls with cognitively impaired individuals in their homes.
Highlights: High levels of agreement were found between automated and gold-standard manual transcriptions of telephone chatbot-based semantic verbal fluency testing, particularly for word count.The qualitative features had fair levels of agreement.Word count reached high levels of discrimination between cognitively impaired and unimpaired individuals, regardless of transcription mode.Automated speech recognition for the assessment of quantitative and qualitative speech features, even when using data from telephone calls with cognitively impaired individuals in their homes, seems feasible and reliable.
期刊介绍:
Alzheimer''s & Dementia: Diagnosis, Assessment & Disease Monitoring (DADM) is an open access, peer-reviewed, journal from the Alzheimer''s Association® that will publish new research that reports the discovery, development and validation of instruments, technologies, algorithms, and innovative processes. Papers will cover a range of topics interested in the early and accurate detection of individuals with memory complaints and/or among asymptomatic individuals at elevated risk for various forms of memory disorders. The expectation for published papers will be to translate fundamental knowledge about the neurobiology of the disease into practical reports that describe both the conceptual and methodological aspects of the submitted scientific inquiry. Published topics will explore the development of biomarkers, surrogate markers, and conceptual/methodological challenges. Publication priority will be given to papers that 1) describe putative surrogate markers that accurately track disease progression, 2) biomarkers that fulfill international regulatory requirements, 3) reports from large, well-characterized population-based cohorts that comprise the heterogeneity and diversity of asymptomatic individuals and 4) algorithmic development that considers multi-marker arrays (e.g., integrated-omics, genetics, biofluids, imaging, etc.) and advanced computational analytics and technologies.