Mara Barberis, Pieter De Clercq, Bastiaan Tamm, Hugo Van hamme, Maaike Vandermosten
{"title":"自动识别和检测失语自然语音","authors":"Mara Barberis, Pieter De Clercq, Bastiaan Tamm, Hugo Van hamme, Maaike Vandermosten","doi":"arxiv-2408.14082","DOIUrl":null,"url":null,"abstract":"Aphasia is a language disorder affecting one third of stroke patients.\nCurrent aphasia assessment does not consider natural speech due to the time\nconsuming nature of manual transcriptions and a lack of knowledge on how to\nanalyze such data. Here, we evaluate the potential of automatic speech\nrecognition (ASR) to transcribe Dutch aphasic speech and the ability of natural\nspeech features to detect aphasia. A picture-description task was administered\nand automatically transcribed in 62 persons with aphasia and 57 controls.\nAcoustic and linguistic features were semi-automatically extracted and provided\nas input to a support vector machine (SVM) classifier. Our ASR model obtained a\nWER of 24.5%, outperforming earlier ASR models for aphasia. The SVM shows high\naccuracy (86.6%) at the individual level, with fluency features as most\ndominant to detect aphasia. ASR and semi-automatic feature extraction can thus\nfacilitate natural speech analysis in a time efficient manner in clinical\npractice.","PeriodicalId":501517,"journal":{"name":"arXiv - QuanBio - Neurons and Cognition","volume":"269 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automatic recognition and detection of aphasic natural speech\",\"authors\":\"Mara Barberis, Pieter De Clercq, Bastiaan Tamm, Hugo Van hamme, Maaike Vandermosten\",\"doi\":\"arxiv-2408.14082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Aphasia is a language disorder affecting one third of stroke patients.\\nCurrent aphasia assessment does not consider natural speech due to the time\\nconsuming nature of manual transcriptions and a lack of knowledge on how to\\nanalyze such data. Here, we evaluate the potential of automatic speech\\nrecognition (ASR) to transcribe Dutch aphasic speech and the ability of natural\\nspeech features to detect aphasia. A picture-description task was administered\\nand automatically transcribed in 62 persons with aphasia and 57 controls.\\nAcoustic and linguistic features were semi-automatically extracted and provided\\nas input to a support vector machine (SVM) classifier. Our ASR model obtained a\\nWER of 24.5%, outperforming earlier ASR models for aphasia. The SVM shows high\\naccuracy (86.6%) at the individual level, with fluency features as most\\ndominant to detect aphasia. ASR and semi-automatic feature extraction can thus\\nfacilitate natural speech analysis in a time efficient manner in clinical\\npractice.\",\"PeriodicalId\":501517,\"journal\":{\"name\":\"arXiv - QuanBio - Neurons and Cognition\",\"volume\":\"269 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Neurons and Cognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.14082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Neurons and Cognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.14082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
失语症是一种语言障碍,影响着三分之一的中风患者。目前的失语症评估并不考虑自然语音,原因是人工转录耗时,而且缺乏如何分析此类数据的知识。在此,我们评估了自动语音识别(ASR)转录荷兰语失语语音的潜力以及自然语音特征检测失语症的能力。我们对 62 名失语症患者和 57 名对照组患者的图片描述任务进行了自动转录,并半自动提取了声学和语言特征,作为支持向量机 (SVM) 分类器的输入。我们的 ASR 模型的误码率为 24.5%,优于早期的失语症 ASR 模型。SVM 在个体水平上显示出较高的准确率(86.6%),其中流利度特征是检测失语症的最主要特征。因此,在临床实践中,ASR 和半自动特征提取能够以省时省力的方式促进自然语音分析。
Automatic recognition and detection of aphasic natural speech
Aphasia is a language disorder affecting one third of stroke patients.
Current aphasia assessment does not consider natural speech due to the time
consuming nature of manual transcriptions and a lack of knowledge on how to
analyze such data. Here, we evaluate the potential of automatic speech
recognition (ASR) to transcribe Dutch aphasic speech and the ability of natural
speech features to detect aphasia. A picture-description task was administered
and automatically transcribed in 62 persons with aphasia and 57 controls.
Acoustic and linguistic features were semi-automatically extracted and provided
as input to a support vector machine (SVM) classifier. Our ASR model obtained a
WER of 24.5%, outperforming earlier ASR models for aphasia. The SVM shows high
accuracy (86.6%) at the individual level, with fluency features as most
dominant to detect aphasia. ASR and semi-automatic feature extraction can thus
facilitate natural speech analysis in a time efficient manner in clinical
practice.