Voicesauce: A Program for Voice Analysis

Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu
{"title":"Voicesauce: A Program for Voice Analysis","authors":"Yen-Liang Shue, P. Keating, C. Vicenik, Kristine M. Yu","doi":"10.1121/1.3248865","DOIUrl":null,"url":null,"abstract":"VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...","PeriodicalId":74531,"journal":{"name":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","volume":"137 1","pages":"1846-1849"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"355","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... International Congress of Phonetic Sciences. International Congress of Phonetic Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1121/1.3248865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 355

Abstract

VOICESAUCE is a new application, implemented in MATLAB, which provides automated voice measurements over time from audio recordings. The measures currently computed are F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3, energy, Cepstral Peak Prominence, F1–F4, and B1–B4, where (*) indicates that harmonic amplitudes are reported with and without corrections for formant frequencies and bandwidths [Iseli et al. (2006)]. Formant values are calculated using the Snack Sound Toolkit, while F0 is calculated using the STRAIGHT algorithm; harmonic spectra magnitudes are computed pitch‐synchronously. VOICESAUCE takes as input a folder of wav files, and for each input wav file produces a MATLAB file with values every millsecond for all measures. It can operate over the whole input file or over segments delimited by a PRAAT textgrid file. VOICESAUCE then takes these MATLAB outputs, optionally along with electroglottographic measurements obtained separately from PCQUIRERX, and provides con...
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Voicesauce:语音分析程序
VOICESAUCE是一个在MATLAB中实现的新应用程序,它可以根据录音提供随时间的自动语音测量。目前计算的测量值为F0, H1(*), H2(*), H4(*), H1(*)‐H2(*), H2(*)‐H4(*), H1(*)‐A1, H1(*)‐A2, H1(*)‐A3,能量,倒频峰突出,F1-F4和B1-B4,其中(*)表示谐波幅度报告有或没有对形成峰频率和带宽进行修正[Iseli等人(2006)]。使用小吃声音工具包计算形成峰值,而使用STRAIGHT算法计算F0;谐波谱的幅度是同步计算的。VOICESAUCE将wav文件文件夹作为输入,并为每个输入wav文件生成一个MATLAB文件,其值每毫秒用于所有测量。它可以对整个输入文件或由PRAAT textgrid文件分隔的段进行操作。VOICESAUCE然后采取这些MATLAB输出,可选地与从PCQUIRERX单独获得的声门电测量一起,并提供…
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
DENTOFACIAL DISHARMONY PATIENTS' SIBILANTS DIFFER FROM CONTROLS' MORE IN SOURCE THAN FILTER PROPERTIES. The Perceptual Contribution of Consonants and Vowels to Sentence Recognition: Effect of Dialect Variation in American English. LISTENER PREFERENCE IS FOR REDUCED DETERMINERS THAT ANTICIPATE THE FOLLOWING NOUN. Computer-Assisted Syllable Complexity Analysis of Continuous Speech as a Measure of Child Speech Disorders. CHARACTERIZING THE COORDINATION OF SPEECH PRODUCTION AND BREATHING.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1