Analysis of Speech Signals Using Excitation Source Information

Shreya R. Garipalli, B. Sathe-Pathak, A. Panat
{"title":"Analysis of Speech Signals Using Excitation Source Information","authors":"Shreya R. Garipalli, B. Sathe-Pathak, A. Panat","doi":"10.1109/ICMETE.2016.12","DOIUrl":null,"url":null,"abstract":"Speech is output of the time varying vocal tractsystem excited with the time varying excitation. Speech isproduced due to the impulse like excitation in each glottal cycle. During the production of speech, the instant of significantexcitation of the vocal tract system is referred to as epoch. In caseof voiced speech, most significant excitation takes place at theinstants of glottal closure i.e. glottal closure instants can bereferred as instants of significant excitation. Speech laugh is asignal produced when laughter occurs with neutral speech. Thespeech-laugh signal occurs frequently in natural conversationwith people. The features of speech-laugh, laughter and singingvoice deviates from the features of neutral speech. In this paper, we discriminate laughter, speech-laugh and neutral speech anddiscriminate singing voice and speech by obtaining epochlocations and extracting new features from these epochs. Themethod used here for the extraction of epochs is the ModifiedZero Frequency Filtering method. The features extracted fromepochs for the discrimination are fundamental frequency(f0) andslope of f0(α) at epoch locations and number of epochs (k) andstrength of excitation (β).","PeriodicalId":167368,"journal":{"name":"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMETE.2016.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Speech is output of the time varying vocal tractsystem excited with the time varying excitation. Speech isproduced due to the impulse like excitation in each glottal cycle. During the production of speech, the instant of significantexcitation of the vocal tract system is referred to as epoch. In caseof voiced speech, most significant excitation takes place at theinstants of glottal closure i.e. glottal closure instants can bereferred as instants of significant excitation. Speech laugh is asignal produced when laughter occurs with neutral speech. Thespeech-laugh signal occurs frequently in natural conversationwith people. The features of speech-laugh, laughter and singingvoice deviates from the features of neutral speech. In this paper, we discriminate laughter, speech-laugh and neutral speech anddiscriminate singing voice and speech by obtaining epochlocations and extracting new features from these epochs. Themethod used here for the extraction of epochs is the ModifiedZero Frequency Filtering method. The features extracted fromepochs for the discrimination are fundamental frequency(f0) andslope of f0(α) at epoch locations and number of epochs (k) andstrength of excitation (β).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于激励源信息的语音信号分析
语音是时变声道系统在时变激励作用下的输出。语音的产生是由于每个声门周期的脉冲兴奋。在语音产生过程中,声道系统显著兴奋的瞬间被称为epoch。在发音的情况下,最显著的兴奋发生在声门关闭的瞬间,即声门关闭的瞬间可以称为显著兴奋的瞬间。言语笑是笑与中性言语相结合时产生的一种信号。在与人的自然对话中,言语笑信号经常出现。言语特征——笑、笑、唱——偏离了中性言语的特征。本文通过获取时代定位并从中提取新的特征,对笑声、语笑和中性语音进行了区分,并对歌声和语音进行了区分。这里用于提取历元的方法是修改的零频率滤波方法。从历元中提取的特征为历元位置的基频(f0)和斜率(α)、历元数(k)和激励强度(β)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Study of E-shaped Patch Antenna with Two Rectangular Slots Text Summarization of Hindi Documents Using Rule Based Approach Estimation of Respiratory Rate from the ECG Using Instantaneous Frequency Tracking FxLMS Algorithm Low Power and High Performance Ring Counter Using Pulsed Latch Technique Satellite Image Enhancement using Discrete Wavelet Transform, Singular Value Decomposition and its Noise Performance Analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1