{"title":"Analysis of Speech Signals Using Excitation Source Information","authors":"Shreya R. Garipalli, B. Sathe-Pathak, A. Panat","doi":"10.1109/ICMETE.2016.12","DOIUrl":null,"url":null,"abstract":"Speech is output of the time varying vocal tractsystem excited with the time varying excitation. Speech isproduced due to the impulse like excitation in each glottal cycle. During the production of speech, the instant of significantexcitation of the vocal tract system is referred to as epoch. In caseof voiced speech, most significant excitation takes place at theinstants of glottal closure i.e. glottal closure instants can bereferred as instants of significant excitation. Speech laugh is asignal produced when laughter occurs with neutral speech. Thespeech-laugh signal occurs frequently in natural conversationwith people. The features of speech-laugh, laughter and singingvoice deviates from the features of neutral speech. In this paper, we discriminate laughter, speech-laugh and neutral speech anddiscriminate singing voice and speech by obtaining epochlocations and extracting new features from these epochs. Themethod used here for the extraction of epochs is the ModifiedZero Frequency Filtering method. The features extracted fromepochs for the discrimination are fundamental frequency(f0) andslope of f0(α) at epoch locations and number of epochs (k) andstrength of excitation (β).","PeriodicalId":167368,"journal":{"name":"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMETE.2016.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Speech is output of the time varying vocal tractsystem excited with the time varying excitation. Speech isproduced due to the impulse like excitation in each glottal cycle. During the production of speech, the instant of significantexcitation of the vocal tract system is referred to as epoch. In caseof voiced speech, most significant excitation takes place at theinstants of glottal closure i.e. glottal closure instants can bereferred as instants of significant excitation. Speech laugh is asignal produced when laughter occurs with neutral speech. Thespeech-laugh signal occurs frequently in natural conversationwith people. The features of speech-laugh, laughter and singingvoice deviates from the features of neutral speech. In this paper, we discriminate laughter, speech-laugh and neutral speech anddiscriminate singing voice and speech by obtaining epochlocations and extracting new features from these epochs. Themethod used here for the extraction of epochs is the ModifiedZero Frequency Filtering method. The features extracted fromepochs for the discrimination are fundamental frequency(f0) andslope of f0(α) at epoch locations and number of epochs (k) andstrength of excitation (β).