不同声音参数下说话人区别功率不对称性的评估

ISAPh 2022, 4th International Symposium on Applied Phonetics Pub Date : 2022-09-14 DOI:10.21437/isaph.2022-2

Julio Cesar Cavalcanti, A. Eriksson, P. Barbosa

{"title":"不同声音参数下说话人区别功率不对称性的评估","authors":"Julio Cesar Cavalcanti, A. Eriksson, P. Barbosa","doi":"10.21437/isaph.2022-2","DOIUrl":null,"url":null,"abstract":"This pilot study set out to assess the speaker discriminatory power asymmetry regarding parameters from different phonetic dimensions in spontaneous speech, i.e., spectral, melodic, and temporal. The speech material consisted of spontaneous telephone conversations between siblings. The participants were 20 male subjects, Brazilian Portuguese speakers from the same dialectal area. Six acoustic-phonetic parameters were chosen for the comparison: f0 median, f0 baseline, speech rate, articulation rate, F3, and F4. Overall, acoustic parameters pertaining to the speech tempo category depicted the worse performance in terms of speaker discriminatory power when assessed in isolation. Such a trend was indicated by the relatively higher median and mean Cllr and EER values. Moreover, from the set of parameters assessed, high formant frequencies, i.e., F3 and F4, were the best-performing estimates in terms of discriminability depicting the lowest EER and Cllr values. The results suggested a speaker discriminatory power asymmetry concerning different acoustic-phonetic parameters, in which speech tempo estimates presented a lower discriminatory power when compared to melodic and spectral parameters. The findings also suggest that data sampling is crucial for the reliability of Cllr and EER calculations.","PeriodicalId":406640,"journal":{"name":"ISAPh 2022, 4th International Symposium on Applied Phonetics","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing the speaker discriminatory power asymmetry of different acoustic-phonetic parameters\",\"authors\":\"Julio Cesar Cavalcanti, A. Eriksson, P. Barbosa\",\"doi\":\"10.21437/isaph.2022-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This pilot study set out to assess the speaker discriminatory power asymmetry regarding parameters from different phonetic dimensions in spontaneous speech, i.e., spectral, melodic, and temporal. The speech material consisted of spontaneous telephone conversations between siblings. The participants were 20 male subjects, Brazilian Portuguese speakers from the same dialectal area. Six acoustic-phonetic parameters were chosen for the comparison: f0 median, f0 baseline, speech rate, articulation rate, F3, and F4. Overall, acoustic parameters pertaining to the speech tempo category depicted the worse performance in terms of speaker discriminatory power when assessed in isolation. Such a trend was indicated by the relatively higher median and mean Cllr and EER values. Moreover, from the set of parameters assessed, high formant frequencies, i.e., F3 and F4, were the best-performing estimates in terms of discriminability depicting the lowest EER and Cllr values. The results suggested a speaker discriminatory power asymmetry concerning different acoustic-phonetic parameters, in which speech tempo estimates presented a lower discriminatory power when compared to melodic and spectral parameters. The findings also suggest that data sampling is crucial for the reliability of Cllr and EER calculations.\",\"PeriodicalId\":406640,\"journal\":{\"name\":\"ISAPh 2022, 4th International Symposium on Applied Phonetics\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISAPh 2022, 4th International Symposium on Applied Phonetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21437/isaph.2022-2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISAPh 2022, 4th International Symposium on Applied Phonetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/isaph.2022-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本初步研究旨在评估自发语音中不同语音维度(即频谱、旋律和时间)参数的说话人歧视性权力不对称。演讲材料包括兄弟姐妹之间自发的电话交谈。参与者是来自同一方言地区的20名说巴西葡萄牙语的男性。选择6个声学-语音参数进行比较:f0中位数、f0基线、语速、发音率、F3和F4。总体而言，当单独评估时，与语音节奏类别相关的声学参数在说话者区分能力方面表现较差。Cllr和EER的中位数和平均值相对较高表明了这种趋势。此外，从评估的一组参数来看，高形成峰频率，即F3和F4，在描述最低EER和Cllr值的可判别性方面是表现最好的估计。结果表明，在不同的声音参数下，说话人的区分能力不对称，其中语速估计比旋律和频谱参数具有更低的区分能力。研究结果还表明，数据采样对于Cllr和EER计算的可靠性至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Assessing the speaker discriminatory power asymmetry of different acoustic-phonetic parameters

This pilot study set out to assess the speaker discriminatory power asymmetry regarding parameters from different phonetic dimensions in spontaneous speech, i.e., spectral, melodic, and temporal. The speech material consisted of spontaneous telephone conversations between siblings. The participants were 20 male subjects, Brazilian Portuguese speakers from the same dialectal area. Six acoustic-phonetic parameters were chosen for the comparison: f0 median, f0 baseline, speech rate, articulation rate, F3, and F4. Overall, acoustic parameters pertaining to the speech tempo category depicted the worse performance in terms of speaker discriminatory power when assessed in isolation. Such a trend was indicated by the relatively higher median and mean Cllr and EER values. Moreover, from the set of parameters assessed, high formant frequencies, i.e., F3 and F4, were the best-performing estimates in terms of discriminability depicting the lowest EER and Cllr values. The results suggested a speaker discriminatory power asymmetry concerning different acoustic-phonetic parameters, in which speech tempo estimates presented a lower discriminatory power when compared to melodic and spectral parameters. The findings also suggest that data sampling is crucial for the reliability of Cllr and EER calculations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ISAPh 2022, 4th International Symposium on Applied Phonetics

自引率

0.00%

发文量