{"title":"Automatic Detection of Mispronounced Lyrics in Singing","authors":"Wei-Ho Tsai, Van-Thuan Tran, Shiang-Shiun Kung","doi":"10.1109/ICMLC48188.2019.8949315","DOIUrl":null,"url":null,"abstract":"In this study, we propose an automatic system for detecting mispronounced lyrics in singing, thereby providing information for singing performance assessment. The system is built upon the basis of speech utterance verification and further improved by considering the difference between singing and speech. We recognize that the vowels are often lengthened during singing and thus include a duration modeling concept in the acoustic modeling to absorb the variation of the length of a vowel in singing. Our experiments show that the proposed methods can achieve 11.3% equal error rate in detecting the mispronounced lyrics in singing.","PeriodicalId":221349,"journal":{"name":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Machine Learning and Cybernetics (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC48188.2019.8949315","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this study, we propose an automatic system for detecting mispronounced lyrics in singing, thereby providing information for singing performance assessment. The system is built upon the basis of speech utterance verification and further improved by considering the difference between singing and speech. We recognize that the vowels are often lengthened during singing and thus include a duration modeling concept in the acoustic modeling to absorb the variation of the length of a vowel in singing. Our experiments show that the proposed methods can achieve 11.3% equal error rate in detecting the mispronounced lyrics in singing.