Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors.

Q1 Computer Science Digital Biomarkers Pub Date : 2020-12-30 eCollection Date: 2021-01-01 DOI:10.1159/000512394

Katharina Schultebraucks, Vijay Yadav, Isaac R Galatzer-Levy

{"title":"Utilization of Machine Learning-Based Computer Vision and Voice Analysis to Derive Digital Biomarkers of Cognitive Functioning in Trauma Survivors.","authors":"Katharina Schultebraucks, Vijay Yadav, Isaac R Galatzer-Levy","doi":"10.1159/000512394","DOIUrl":null,"url":null,"abstract":"Background: Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual's clinical response to trauma. However, such assessments are burdensome, costly, and time-consuming. To overcome barriers, efforts have emerged to measure multiple domains of cognitive functioning through the application of machine learning (ML) models to passive data sources.Methods: We utilized automated computer vision and voice analysis methods to extract facial, movement, and speech characteristics from semi-structured clinical interviews in 81 trauma survivors who additionally completed a cognitive assessment battery. A ML-based regression framework was used to identify variance in visual and auditory measures that relate to multiple cognitive domains.Results: Models derived from visual and auditory measures collectively accounted for a large variance in multiple domains of cognitive functioning, including motor coordination (R2 = 0.52), processing speed (R2 = 0.42), emotional bias (R2 = 0.52), sustained attention (R2 = 0.51), controlled attention (R2 = 0.44), cognitive flexibility (R2 = 0.43), cognitive inhibition (R2 = 0.64), and executive functioning (R2 = 0.63), consistent with the high test-retest reliability of traditional cognitive assessments. Face, voice, speech content, and movement have all significantly contributed to explaining the variance in predicting functioning in all cognitive domains.Conclusions: The results demonstrate the feasibility of automated measurement of reliable proxies of cognitive functioning through low-burden passive patient evaluations. This makes it easier to monitor cognitive functions and to intervene earlier and at a lower threshold without requiring a time-consuming neurocognitive assessment by, for instance, a licensed psychologist with specialized training in neuropsychology.","PeriodicalId":11242,"journal":{"name":"Digital Biomarkers","volume":"5 1","pages":"16-23"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1159/000512394","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Biomarkers","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1159/000512394","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 11

Abstract

Background: Alterations in multiple domains of cognition have been observed in individuals who have experienced a traumatic stressor. These domains may provide important insights in identifying underlying neurobiological dysfunction driving an individual's clinical response to trauma. However, such assessments are burdensome, costly, and time-consuming. To overcome barriers, efforts have emerged to measure multiple domains of cognitive functioning through the application of machine learning (ML) models to passive data sources.

Methods: We utilized automated computer vision and voice analysis methods to extract facial, movement, and speech characteristics from semi-structured clinical interviews in 81 trauma survivors who additionally completed a cognitive assessment battery. A ML-based regression framework was used to identify variance in visual and auditory measures that relate to multiple cognitive domains.

Results: Models derived from visual and auditory measures collectively accounted for a large variance in multiple domains of cognitive functioning, including motor coordination (R² = 0.52), processing speed (R² = 0.42), emotional bias (R² = 0.52), sustained attention (R² = 0.51), controlled attention (R² = 0.44), cognitive flexibility (R² = 0.43), cognitive inhibition (R² = 0.64), and executive functioning (R² = 0.63), consistent with the high test-retest reliability of traditional cognitive assessments. Face, voice, speech content, and movement have all significantly contributed to explaining the variance in predicting functioning in all cognitive domains.

Conclusions: The results demonstrate the feasibility of automated measurement of reliable proxies of cognitive functioning through low-burden passive patient evaluations. This makes it easier to monitor cognitive functions and to intervene earlier and at a lower threshold without requiring a time-consuming neurocognitive assessment by, for instance, a licensed psychologist with specialized training in neuropsychology.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用基于机器学习的计算机视觉和语音分析来获得创伤幸存者认知功能的数字生物标志物。

背景:在经历过创伤性应激源的个体中，已经观察到多个认知领域的改变。这些领域可能为识别驱动个体对创伤临床反应的潜在神经生物学功能障碍提供重要见解。然而，这样的评估是繁重的、昂贵的和耗时的。为了克服障碍，已经出现了通过将机器学习(ML)模型应用于被动数据源来测量认知功能多个领域的努力。方法:我们利用自动计算机视觉和语音分析方法从81名创伤幸存者的半结构化临床访谈中提取面部、运动和语言特征，这些幸存者还完成了认知评估电池。使用基于ml的回归框架来识别与多个认知领域相关的视觉和听觉测量的差异。结果:来自视觉和听觉测量的模型在多个认知功能领域，包括运动协调(R2 = 0.52)、处理速度(R2 = 0.42)、情绪偏见(R2 = 0.52)、持续注意力(R2 = 0.51)、控制注意力(R2 = 0.44)、认知灵活性(R2 = 0.43)、认知抑制(R2 = 0.64)和执行功能(R2 = 0.63)，都有很大的差异。与传统认知评估的高重测信度一致。面部、声音、言语内容和动作都对解释预测所有认知领域功能的差异有重要贡献。结论:研究结果表明，通过低负担的被动患者评估，自动测量可靠的认知功能指标是可行的。这使得监测认知功能更容易，更早、更低的阈值进行干预，而不需要耗时的神经认知评估，例如，由经过神经心理学专业培训的有执照的心理学家进行评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊