Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu
{"title":"Voicesauce:语音分析程序","authors":"Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu","doi":"10.1121/1.3248865","DOIUrl":null,"url":null,"abstract":"VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...","PeriodicalId":74531,"journal":{"name":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","volume":"137 1","pages":"1846-1849"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"355","resultStr":"{\"title\":\"Voicesauce: A Program for Voice Analysis\",\"authors\":\"Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu\",\"doi\":\"10.1121/1.3248865\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...\",\"PeriodicalId\":74531,\"journal\":{\"name\":\"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences\",\"volume\":\"137 1\",\"pages\":\"1846-1849\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"355\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1121/1.3248865\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/1.3248865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...