{"title":"基于傅立叶分解方法的语音瞬时基频估计","authors":"Pushpendra Singh, Amit Singhal, Binish Fatimah","doi":"10.1109/SPCOM55316.2022.9840851","DOIUrl":null,"url":null,"abstract":"Speech analysis and various speech processing applications use instantaneous fundamental frequency $(F_{0})$ of voiced speech signal as a prime acoustic parameter. The low frequency component of the voiced speech possesses most of the energy in F0 neighbourhood and its few harmonics. In this study, a novel approach is proposed to extract the instantaneous F0 component from a voiced speech signal using Fourier decomposition method (FDM), which decomposes the signal into its amplitude-frequency modulated (AM-FM) components. We also demonstrate that these derived AM-FM components, obtained due to desired frequency band decomposition property of FDM, provides the most suitable representation for voiced speech as compared to other AM-FM models available in the literature. Numerical results are presented to validate the adequacy of proposed method in estimating F0, when compared with existing algorithms based on empirical mode decomposition (EMD) and other speech-related algorithms.","PeriodicalId":246982,"journal":{"name":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Instantaneous Fundamental Frequency Estimation from Speech using Fourier Decomposition Method\",\"authors\":\"Pushpendra Singh, Amit Singhal, Binish Fatimah\",\"doi\":\"10.1109/SPCOM55316.2022.9840851\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speech analysis and various speech processing applications use instantaneous fundamental frequency $(F_{0})$ of voiced speech signal as a prime acoustic parameter. The low frequency component of the voiced speech possesses most of the energy in F0 neighbourhood and its few harmonics. In this study, a novel approach is proposed to extract the instantaneous F0 component from a voiced speech signal using Fourier decomposition method (FDM), which decomposes the signal into its amplitude-frequency modulated (AM-FM) components. We also demonstrate that these derived AM-FM components, obtained due to desired frequency band decomposition property of FDM, provides the most suitable representation for voiced speech as compared to other AM-FM models available in the literature. Numerical results are presented to validate the adequacy of proposed method in estimating F0, when compared with existing algorithms based on empirical mode decomposition (EMD) and other speech-related algorithms.\",\"PeriodicalId\":246982,\"journal\":{\"name\":\"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPCOM55316.2022.9840851\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing and Communications (SPCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPCOM55316.2022.9840851","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Instantaneous Fundamental Frequency Estimation from Speech using Fourier Decomposition Method
Speech analysis and various speech processing applications use instantaneous fundamental frequency $(F_{0})$ of voiced speech signal as a prime acoustic parameter. The low frequency component of the voiced speech possesses most of the energy in F0 neighbourhood and its few harmonics. In this study, a novel approach is proposed to extract the instantaneous F0 component from a voiced speech signal using Fourier decomposition method (FDM), which decomposes the signal into its amplitude-frequency modulated (AM-FM) components. We also demonstrate that these derived AM-FM components, obtained due to desired frequency band decomposition property of FDM, provides the most suitable representation for voiced speech as compared to other AM-FM models available in the literature. Numerical results are presented to validate the adequacy of proposed method in estimating F0, when compared with existing algorithms based on empirical mode decomposition (EMD) and other speech-related algorithms.