Voicesauce:语音分析程序

Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences Pub Date : 2009-10-06 DOI:10.1121/1.3248865

Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu

{"title":"Voicesauce:语音分析程序","authors":"Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu","doi":"10.1121/1.3248865","DOIUrl":null,"url":null,"abstract":"VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...","PeriodicalId":74531,"journal":{"name":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","volume":"137 1","pages":"1846-1849"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"355","resultStr":"{\"title\":\"Voicesauce: A Program for Voice Analysis\",\"authors\":\"Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu\",\"doi\":\"10.1121/1.3248865\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...\",\"PeriodicalId\":74531,\"journal\":{\"name\":\"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences\",\"volume\":\"137 1\",\"pages\":\"1846-1849\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"355\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1121/1.3248865\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/1.3248865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 355

摘要

VOICESAUCE是一个在MATLAB中实现的新应用程序，它可以根据录音提供随时间的自动语音测量。目前计算的测量值为F0, H1(*)， H2(*)， H4(*)， H1(*)‐H2(*)， H2(*)‐H4(*)， H1(*)‐A1, H1(*)‐A2, H1(*)‐A3，能量，倒频峰突出，F1-F4和B1-B4，其中(*)表示谐波幅度报告有或没有对形成峰频率和带宽进行修正[Iseli等人(2006)]。使用小吃声音工具包计算形成峰值，而使用STRAIGHT算法计算F0;谐波谱的幅度是同步计算的。VOICESAUCE将wav文件文件夹作为输入，并为每个输入wav文件生成一个MATLAB文件，其值每毫秒用于所有测量。它可以对整个输入文件或由PRAAT textgrid文件分隔的段进行操作。VOICESAUCE然后采取这些MATLAB输出，可选地与从PCQUIRERX单独获得的声门电测量一起，并提供…

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Voicesauce: A Program for Voice Analysis

VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences

自引率

0.00%

发文量