Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment.

Q1 Computer Science Digital Biomarkers Pub Date : 2023-08-31 eCollection Date: 2023-01-01 DOI:10.1159/000533188

Daphne Ter Huurne, Nina Possemis, Leonie Banning, Angélique Gruters, Alexandra König, Nicklas Linz, Johannes Tröger, Kai Langel, Frans Verhey, Marjolein de Vugt, Inez Ramakers

{"title":"Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment.","authors":"Daphne Ter Huurne, Nina Possemis, Leonie Banning, Angélique Gruters, Alexandra König, Nicklas Linz, Johannes Tröger, Kai Langel, Frans Verhey, Marjolein de Vugt, Inez Ramakers","doi":"10.1159/000533188","DOIUrl":null,"url":null,"abstract":"Introduction: We studied the accuracy of the automatic speech recognition (ASR) software by comparing ASR scores with manual scores from a verbal learning test (VLT) and a semantic verbal fluency (SVF) task in a semiautomated phone assessment in a memory clinic population. Furthermore, we examined the differentiating value of these tests between participants with subjective cognitive decline (SCD) and mild cognitive impairment (MCI). We also investigated whether the automatically calculated speech and linguistic features had an additional value compared to the commonly used total scores in a semiautomated phone assessment.Methods: We included 94 participants from the memory clinic of the Maastricht University Medical Center+ (SCD N = 56 and MCI N = 38). The test leader guided the participant through a semiautomated phone assessment. The VLT and SVF were audio recorded and processed via a mobile application. The recall count and speech and linguistic features were automatically extracted. The diagnostic groups were classified by training machine learning classifiers to differentiate SCD and MCI participants.Results: The intraclass correlation for inter-rater reliability between the manual and the ASR total word count was 0.89 (95% CI 0.09-0.97) for the VLT immediate recall, 0.94 (95% CI 0.68-0.98) for the VLT delayed recall, and 0.93 (95% CI 0.56-0.97) for the SVF. The full model including the total word count and speech and linguistic features had an area under the curve of 0.81 and 0.77 for the VLT immediate and delayed recall, respectively, and 0.61 for the SVF.Conclusion: There was a high agreement between the ASR and manual scores, keeping the broad confidence intervals in mind. The phone-based VLT was able to differentiate between SCD and MCI and can have opportunities for clinical trial screening.","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"115-123"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601928/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Biomarkers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1159/000533188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: We studied the accuracy of the automatic speech recognition (ASR) software by comparing ASR scores with manual scores from a verbal learning test (VLT) and a semantic verbal fluency (SVF) task in a semiautomated phone assessment in a memory clinic population. Furthermore, we examined the differentiating value of these tests between participants with subjective cognitive decline (SCD) and mild cognitive impairment (MCI). We also investigated whether the automatically calculated speech and linguistic features had an additional value compared to the commonly used total scores in a semiautomated phone assessment.

Methods: We included 94 participants from the memory clinic of the Maastricht University Medical Center+ (SCD N = 56 and MCI N = 38). The test leader guided the participant through a semiautomated phone assessment. The VLT and SVF were audio recorded and processed via a mobile application. The recall count and speech and linguistic features were automatically extracted. The diagnostic groups were classified by training machine learning classifiers to differentiate SCD and MCI participants.

Results: The intraclass correlation for inter-rater reliability between the manual and the ASR total word count was 0.89 (95% CI 0.09-0.97) for the VLT immediate recall, 0.94 (95% CI 0.68-0.98) for the VLT delayed recall, and 0.93 (95% CI 0.56-0.97) for the SVF. The full model including the total word count and speech and linguistic features had an area under the curve of 0.81 and 0.77 for the VLT immediate and delayed recall, respectively, and 0.61 for the SVF.

Conclusion: There was a high agreement between the ASR and manual scores, keeping the broad confidence intervals in mind. The phone-based VLT was able to differentiate between SCD and MCI and can have opportunities for clinical trial screening.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

半自动电话评估中认知任务的自动语音分析的验证。

引言：我们在记忆诊所人群的半自动电话评估中，通过将ASR分数与语言学习测试（VLT）和语义语言流利性（SVF）任务的手动分数进行比较，研究了自动语音识别（ASR）软件的准确性。此外，我们还检验了这些测试在主观认知能力下降（SCD）和轻度认知障碍（MCI）参与者之间的区分价值。我们还调查了在半自动电话评估中，与常用的总分相比，自动计算的语音和语言特征是否具有附加值。方法：我们纳入了来自马斯特里赫特大学医学中心+记忆诊所的94名参与者（SCD N=56，MCI N=38）。测试负责人指导参与者进行半自动电话评估。VLT和SVF通过移动应用程序进行音频记录和处理。自动提取回忆次数以及语音和语言特征。通过训练机器学习分类器对诊断组进行分类，以区分SCD和MCI参与者。结果：对于VLT即时回忆，手册和ASR总字数之间的评分者间信度的组内相关性为0.89（95%CI 0.09-0.97），对于VLT延迟回忆为0.94（95%CI 0.68-0.98），对于SVF为0.93（95%CI 0.56-0.97）。包括总字数、语音和语言特征在内的完整模型的VLT立即回忆和延迟回忆的曲线下面积分别为0.81和0.77，SVF的曲线下区域为0.61。结论：ASR和手动评分之间有很高的一致性，记住了广泛的置信区间。基于手机的VLT能够区分SCD和MCI，并有机会进行临床试验筛查。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊