Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment.

Q1 Computer Science Digital Biomarkers Pub Date : 2023-08-31 eCollection Date: 2023-01-01 DOI:10.1159/000533188
Daphne Ter Huurne, Nina Possemis, Leonie Banning, Angélique Gruters, Alexandra König, Nicklas Linz, Johannes Tröger, Kai Langel, Frans Verhey, Marjolein de Vugt, Inez Ramakers
{"title":"Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment.","authors":"Daphne Ter Huurne, Nina Possemis, Leonie Banning, Angélique Gruters, Alexandra König, Nicklas Linz, Johannes Tröger, Kai Langel, Frans Verhey, Marjolein de Vugt, Inez Ramakers","doi":"10.1159/000533188","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>We studied the accuracy of the automatic speech recognition (ASR) software by comparing ASR scores with manual scores from a verbal learning test (VLT) and a semantic verbal fluency (SVF) task in a semiautomated phone assessment in a memory clinic population. Furthermore, we examined the differentiating value of these tests between participants with subjective cognitive decline (SCD) and mild cognitive impairment (MCI). We also investigated whether the automatically calculated speech and linguistic features had an additional value compared to the commonly used total scores in a semiautomated phone assessment.</p><p><strong>Methods: </strong>We included 94 participants from the memory clinic of the Maastricht University Medical Center+ (SCD <i>N</i> = 56 and MCI <i>N</i> = 38). The test leader guided the participant through a semiautomated phone assessment. The VLT and SVF were audio recorded and processed via a mobile application. The recall count and speech and linguistic features were automatically extracted. The diagnostic groups were classified by training machine learning classifiers to differentiate SCD and MCI participants.</p><p><strong>Results: </strong>The intraclass correlation for inter-rater reliability between the manual and the ASR total word count was 0.89 (95% CI 0.09-0.97) for the VLT immediate recall, 0.94 (95% CI 0.68-0.98) for the VLT delayed recall, and 0.93 (95% CI 0.56-0.97) for the SVF. The full model including the total word count and speech and linguistic features had an area under the curve of 0.81 and 0.77 for the VLT immediate and delayed recall, respectively, and 0.61 for the SVF.</p><p><strong>Conclusion: </strong>There was a high agreement between the ASR and manual scores, keeping the broad confidence intervals in mind. The phone-based VLT was able to differentiate between SCD and MCI and can have opportunities for clinical trial screening.</p>","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"7 1","pages":"115-123"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601928/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Biomarkers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1159/000533188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: We studied the accuracy of the automatic speech recognition (ASR) software by comparing ASR scores with manual scores from a verbal learning test (VLT) and a semantic verbal fluency (SVF) task in a semiautomated phone assessment in a memory clinic population. Furthermore, we examined the differentiating value of these tests between participants with subjective cognitive decline (SCD) and mild cognitive impairment (MCI). We also investigated whether the automatically calculated speech and linguistic features had an additional value compared to the commonly used total scores in a semiautomated phone assessment.

Methods: We included 94 participants from the memory clinic of the Maastricht University Medical Center+ (SCD N = 56 and MCI N = 38). The test leader guided the participant through a semiautomated phone assessment. The VLT and SVF were audio recorded and processed via a mobile application. The recall count and speech and linguistic features were automatically extracted. The diagnostic groups were classified by training machine learning classifiers to differentiate SCD and MCI participants.

Results: The intraclass correlation for inter-rater reliability between the manual and the ASR total word count was 0.89 (95% CI 0.09-0.97) for the VLT immediate recall, 0.94 (95% CI 0.68-0.98) for the VLT delayed recall, and 0.93 (95% CI 0.56-0.97) for the SVF. The full model including the total word count and speech and linguistic features had an area under the curve of 0.81 and 0.77 for the VLT immediate and delayed recall, respectively, and 0.61 for the SVF.

Conclusion: There was a high agreement between the ASR and manual scores, keeping the broad confidence intervals in mind. The phone-based VLT was able to differentiate between SCD and MCI and can have opportunities for clinical trial screening.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
半自动电话评估中认知任务的自动语音分析的验证。
引言:我们在记忆诊所人群的半自动电话评估中,通过将ASR分数与语言学习测试(VLT)和语义语言流利性(SVF)任务的手动分数进行比较,研究了自动语音识别(ASR)软件的准确性。此外,我们还检验了这些测试在主观认知能力下降(SCD)和轻度认知障碍(MCI)参与者之间的区分价值。我们还调查了在半自动电话评估中,与常用的总分相比,自动计算的语音和语言特征是否具有附加值。方法:我们纳入了来自马斯特里赫特大学医学中心+记忆诊所的94名参与者(SCD N=56,MCI N=38)。测试负责人指导参与者进行半自动电话评估。VLT和SVF通过移动应用程序进行音频记录和处理。自动提取回忆次数以及语音和语言特征。通过训练机器学习分类器对诊断组进行分类,以区分SCD和MCI参与者。结果:对于VLT即时回忆,手册和ASR总字数之间的评分者间信度的组内相关性为0.89(95%CI 0.09-0.97),对于VLT延迟回忆为0.94(95%CI 0.68-0.98),对于SVF为0.93(95%CI 0.56-0.97)。包括总字数、语音和语言特征在内的完整模型的VLT立即回忆和延迟回忆的曲线下面积分别为0.81和0.77,SVF的曲线下区域为0.61。结论:ASR和手动评分之间有很高的一致性,记住了广泛的置信区间。基于手机的VLT能够区分SCD和MCI,并有机会进行临床试验筛查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Digital Biomarkers
Digital Biomarkers Medicine-Medicine (miscellaneous)
CiteScore
10.60
自引率
0.00%
发文量
12
审稿时长
23 weeks
期刊最新文献
The Imperative of Voice Data Collection in Clinical Trials. eHealth and mHealth in Antimicrobial Stewardship Programs. Detecting Longitudinal Trends between Passively Collected Phone Use and Anxiety among College Students. Video Assessment to Detect Amyotrophic Lateral Sclerosis. Digital Vocal Biomarker of Smoking Status Using Ecological Audio Recordings: Results from the Colive Voice Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1