Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders.

IF 1.2 Q3 ACOUSTICS JASA express letters Pub Date : 2024-02-01 DOI:10.1121/10.0024632
Nina R Benway, Jonathan L Preston, Asif Salekin, Elaine Hitchcock, Tara McAllister
{"title":"Evaluating acoustic representations and normalization for rhoticity classification in children with speech sound disorders.","authors":"Nina R Benway, Jonathan L Preston, Asif Salekin, Elaine Hitchcock, Tara McAllister","doi":"10.1121/10.0024632","DOIUrl":null,"url":null,"abstract":"<p><p>The effects of different acoustic representations and normalizations were compared for classifiers predicting perception of children's rhotic versus derhotic /ɹ/. Formant and Mel frequency cepstral coefficient (MFCC) representations for 350 speakers were z-standardized, either relative to values in the same utterance or age-and-sex data for typical /ɹ/. Statistical modeling indicated age-and-sex normalization significantly increased classifier performances. Clinically interpretable formants performed similarly to MFCCs and were endorsed for deep neural network engineering, achieving mean test-participant-specific F1-score = 0.81 after personalization and replication (σx = 0.10, med = 0.83, n = 48). Shapley additive explanations analysis indicated the third formant most influenced fully rhotic predictions.</p>","PeriodicalId":73538,"journal":{"name":"JASA express letters","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JASA express letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/10.0024632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

The effects of different acoustic representations and normalizations were compared for classifiers predicting perception of children's rhotic versus derhotic /ɹ/. Formant and Mel frequency cepstral coefficient (MFCC) representations for 350 speakers were z-standardized, either relative to values in the same utterance or age-and-sex data for typical /ɹ/. Statistical modeling indicated age-and-sex normalization significantly increased classifier performances. Clinically interpretable formants performed similarly to MFCCs and were endorsed for deep neural network engineering, achieving mean test-participant-specific F1-score = 0.81 after personalization and replication (σx = 0.10, med = 0.83, n = 48). Shapley additive explanations analysis indicated the third formant most influenced fully rhotic predictions.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估声学表征和归一化对语言发音障碍儿童的翘舌音分类。
我们比较了不同声学表征和归一化对预测儿童发音/ɹ/的分类器的影响。对 350 名说话者的声形和梅尔频率倒谱系数(MFCC)表征进行了 z 标准化,或者相对于同一语料中的值,或者相对于典型 /ɹ/ 的年龄和性别数据。统计建模表明,年龄和性别标准化显著提高了分类器的性能。临床可解释声母的表现与 MFCC 相似,并得到了深度神经网络工程的认可,在个性化和复制后,测试参与者特定的平均 F1 分数 = 0.81(σx = 0.10,中间值 = 0.83,n = 48)。夏普利加法解释分析表明,第三声母对完全斜音预测的影响最大。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
1.70
自引率
0.00%
发文量
0
期刊最新文献
Effect of hearing aids on the externalization of everyday sounds. Tests of human auditory temporal resolution: Simulations of Bayesian threshold estimation for auditory gap detection. Hearing aid evaluation for music: Accounting for acoustical variability of music stimuli. Minima in cubic distortion-product otoacoustic emission input/output functions due to distributed primary sources. Investigating muscle coordination patterns with Granger causality analysis in protrusive motion from tagged and diffusion MRI.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1