{"title":"Singing characterization using temporal and spectral features in Indian musical notes","authors":"Shivam Sharma, V. K. Mittal","doi":"10.1109/ICSPCOM.2016.7980604","DOIUrl":null,"url":null,"abstract":"Pitch extraction from a multi pitched music signal significantly relies on the training data for tasks like enhanced music-voice separation. This paper aims at identifying characteristic temporal and spectral features, using speech processing techniques that can help obtain crucial information, leading to a better understanding of the music structure. Towards this goal, the F0 contour has been studied to capture the melodic trends in a Sargam progression, and the results have been compared with the output of state of the art package PRAAT. Effects of pre-emphasising in enhancing the tracking are also discussed. A method is proposed through which the transition trends in the note progression can be validated for correctness and the results are encouraging in characterising the progression. Spectral analysis is done to get some insight into the harmonic behaviour in conjunction with the signal energy pattern. This is followed by the LP Analysis that tells about the Swara Pronunciation. The results observed indicate usefulness of the standard techniques and the constraints posed towards the singing voice analysis.","PeriodicalId":213713,"journal":{"name":"2016 International Conference on Signal Processing and Communication (ICSC)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Signal Processing and Communication (ICSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPCOM.2016.7980604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Pitch extraction from a multi pitched music signal significantly relies on the training data for tasks like enhanced music-voice separation. This paper aims at identifying characteristic temporal and spectral features, using speech processing techniques that can help obtain crucial information, leading to a better understanding of the music structure. Towards this goal, the F0 contour has been studied to capture the melodic trends in a Sargam progression, and the results have been compared with the output of state of the art package PRAAT. Effects of pre-emphasising in enhancing the tracking are also discussed. A method is proposed through which the transition trends in the note progression can be validated for correctness and the results are encouraging in characterising the progression. Spectral analysis is done to get some insight into the harmonic behaviour in conjunction with the signal energy pattern. This is followed by the LP Analysis that tells about the Swara Pronunciation. The results observed indicate usefulness of the standard techniques and the constraints posed towards the singing voice analysis.