S. Chalamala, Balakrishna Gudla, B. Yegnanarayana, K. Sheela
{"title":"Improved lip contour extraction for visual speech recognition","authors":"S. Chalamala, Balakrishna Gudla, B. Yegnanarayana, K. Sheela","doi":"10.1109/ICCE.2015.7066486","DOIUrl":null,"url":null,"abstract":"Automatic speech recognition systems perform on acoustic speech signals and therefore they are unreliable in noisy environments. Visual speech features such as lip movements of a speaker can make the speech recognition system robust. To track the lip movements, lip contour extraction is a necessary step and plays a crucial role in the visual speech recognition. In this paper, we propose a new method for lip contour extraction using fuzzy clustering with elliptic shape information and active contour model. In this method, we combined both image and model based methods to improve the performance of lip contour extraction. Our proposed lip contour extraction method outperforms few of existing lip contour extraction methods. We applied our lip contour extraction method on 3600 lip images from VidTimit database and results are found better than the few existing lip contour extraction methods.","PeriodicalId":169402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics (ICCE)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Consumer Electronics (ICCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE.2015.7066486","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Automatic speech recognition systems perform on acoustic speech signals and therefore they are unreliable in noisy environments. Visual speech features such as lip movements of a speaker can make the speech recognition system robust. To track the lip movements, lip contour extraction is a necessary step and plays a crucial role in the visual speech recognition. In this paper, we propose a new method for lip contour extraction using fuzzy clustering with elliptic shape information and active contour model. In this method, we combined both image and model based methods to improve the performance of lip contour extraction. Our proposed lip contour extraction method outperforms few of existing lip contour extraction methods. We applied our lip contour extraction method on 3600 lip images from VidTimit database and results are found better than the few existing lip contour extraction methods.