{"title":"不同声音参数下说话人区别功率不对称性的评估","authors":"Julio Cesar Cavalcanti, A. Eriksson, P. Barbosa","doi":"10.21437/isaph.2022-2","DOIUrl":null,"url":null,"abstract":"This pilot study set out to assess the speaker discriminatory power asymmetry regarding parameters from different phonetic dimensions in spontaneous speech, i.e., spectral, melodic, and temporal. The speech material consisted of spontaneous telephone conversations between siblings. The participants were 20 male subjects, Brazilian Portuguese speakers from the same dialectal area. Six acoustic-phonetic parameters were chosen for the comparison: f0 median, f0 baseline, speech rate, articulation rate, F3, and F4. Overall, acoustic parameters pertaining to the speech tempo category depicted the worse performance in terms of speaker discriminatory power when assessed in isolation. Such a trend was indicated by the relatively higher median and mean Cllr and EER values. Moreover, from the set of parameters assessed, high formant frequencies, i.e., F3 and F4, were the best-performing estimates in terms of discriminability depicting the lowest EER and Cllr values. The results suggested a speaker discriminatory power asymmetry concerning different acoustic-phonetic parameters, in which speech tempo estimates presented a lower discriminatory power when compared to melodic and spectral parameters. The findings also suggest that data sampling is crucial for the reliability of Cllr and EER calculations.","PeriodicalId":406640,"journal":{"name":"ISAPh 2022, 4th International Symposium on Applied Phonetics","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing the speaker discriminatory power asymmetry of different acoustic-phonetic parameters\",\"authors\":\"Julio Cesar Cavalcanti, A. Eriksson, P. Barbosa\",\"doi\":\"10.21437/isaph.2022-2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This pilot study set out to assess the speaker discriminatory power asymmetry regarding parameters from different phonetic dimensions in spontaneous speech, i.e., spectral, melodic, and temporal. The speech material consisted of spontaneous telephone conversations between siblings. The participants were 20 male subjects, Brazilian Portuguese speakers from the same dialectal area. Six acoustic-phonetic parameters were chosen for the comparison: f0 median, f0 baseline, speech rate, articulation rate, F3, and F4. Overall, acoustic parameters pertaining to the speech tempo category depicted the worse performance in terms of speaker discriminatory power when assessed in isolation. Such a trend was indicated by the relatively higher median and mean Cllr and EER values. Moreover, from the set of parameters assessed, high formant frequencies, i.e., F3 and F4, were the best-performing estimates in terms of discriminability depicting the lowest EER and Cllr values. The results suggested a speaker discriminatory power asymmetry concerning different acoustic-phonetic parameters, in which speech tempo estimates presented a lower discriminatory power when compared to melodic and spectral parameters. The findings also suggest that data sampling is crucial for the reliability of Cllr and EER calculations.\",\"PeriodicalId\":406640,\"journal\":{\"name\":\"ISAPh 2022, 4th International Symposium on Applied Phonetics\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISAPh 2022, 4th International Symposium on Applied Phonetics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21437/isaph.2022-2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISAPh 2022, 4th International Symposium on Applied Phonetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21437/isaph.2022-2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Assessing the speaker discriminatory power asymmetry of different acoustic-phonetic parameters
This pilot study set out to assess the speaker discriminatory power asymmetry regarding parameters from different phonetic dimensions in spontaneous speech, i.e., spectral, melodic, and temporal. The speech material consisted of spontaneous telephone conversations between siblings. The participants were 20 male subjects, Brazilian Portuguese speakers from the same dialectal area. Six acoustic-phonetic parameters were chosen for the comparison: f0 median, f0 baseline, speech rate, articulation rate, F3, and F4. Overall, acoustic parameters pertaining to the speech tempo category depicted the worse performance in terms of speaker discriminatory power when assessed in isolation. Such a trend was indicated by the relatively higher median and mean Cllr and EER values. Moreover, from the set of parameters assessed, high formant frequencies, i.e., F3 and F4, were the best-performing estimates in terms of discriminability depicting the lowest EER and Cllr values. The results suggested a speaker discriminatory power asymmetry concerning different acoustic-phonetic parameters, in which speech tempo estimates presented a lower discriminatory power when compared to melodic and spectral parameters. The findings also suggest that data sampling is crucial for the reliability of Cllr and EER calculations.