Voicesauce:语音分析程序

Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu
{"title":"Voicesauce:语音分析程序","authors":"Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu","doi":"10.1121/1.3248865","DOIUrl":null,"url":null,"abstract":"VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...","PeriodicalId":74531,"journal":{"name":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","volume":"137 1","pages":"1846-1849"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"355","resultStr":"{\"title\":\"Voicesauce: A Program for Voice Analysis\",\"authors\":\"Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu\",\"doi\":\"10.1121/1.3248865\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...\",\"PeriodicalId\":74531,\"journal\":{\"name\":\"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences\",\"volume\":\"137 1\",\"pages\":\"1846-1849\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"355\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1121/1.3248865\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/1.3248865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 355

摘要

VOICESAUCE是一个在MATLAB中实现的新应用程序,它可以根据录音提供随时间的自动语音测量。目前计算的测量值为F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3,能量,倒频峰突出,F1-F4和B1-B4,其中(*)表示谐波幅度报告有或没有对形成峰频率和带宽进行修正[Iseli等人(2006)]。使用小吃声音工具包计算形成峰值,而使用STRAIGHT算法计算F0;谐波谱的幅度是同步计算的。VOICESAUCE将wav文件文件夹作为输入,并为每个输入wav文件生成一个MATLAB文件,其值每毫秒用于所有测量。它可以对整个输入文件或由PRAAT textgrid文件分隔的段进行操作。VOICESAUCE然后采取这些MATLAB输出,可选地与从PCQUIRERX单独获得的声门电测量一起,并提供…
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Voicesauce: A Program for Voice Analysis
VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
DENTOFACIAL DISHARMONY PATIENTS' SIBILANTS DIFFER FROM CONTROLS' MORE IN SOURCE THAN FILTER PROPERTIES. The Perceptual Contribution of Consonants and Vowels to Sentence Recognition: Effect of Dialect Variation in American English. LISTENER PREFERENCE IS FOR REDUCED DETERMINERS THAT ANTICIPATE THE FOLLOWING NOUN. Computer-Assisted Syllable Complexity Analysis of Continuous Speech as a Measure of Child Speech Disorders. CHARACTERIZING THE COORDINATION OF SPEECH PRODUCTION AND BREATHING.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1