{"title":"Lip detection and tracking","authors":"A. Caplier","doi":"10.1109/ICIAP.2001.956978","DOIUrl":null,"url":null,"abstract":"Seeing the talker's lips in addition to audition can improve speech understanding which is rather based on lip shape temporal evolution than on absolute mouth shape. We propose a totally automatic algorithm which can extract lip shape over an image sequence. The algorithm does not require any make-up or markers and works under natural lighting conditions. The lip detection algorithm uses an active shape model to describe the mouth. After a training step, the mouth model is iteratively deformed under constraints according to spatiotemporal energies. The robust prior detection of mouth corners and Cupidon's arch yields the automatic positioning of the initial shape which is very difficult and must be as accurate as possible. Temporal information integration comes from the definition of Kalman filters on the independent mouth parameters. Such filtering gives an initial shape close to the final one which speeds up the convergence rate. We point out on the behaviour of our algorithm when a transition open mouth/closed mouth or closed mouth/open mouth occurs.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"34","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 11th International Conference on Image Analysis and Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAP.2001.956978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 34
Abstract
Seeing the talker's lips in addition to audition can improve speech understanding which is rather based on lip shape temporal evolution than on absolute mouth shape. We propose a totally automatic algorithm which can extract lip shape over an image sequence. The algorithm does not require any make-up or markers and works under natural lighting conditions. The lip detection algorithm uses an active shape model to describe the mouth. After a training step, the mouth model is iteratively deformed under constraints according to spatiotemporal energies. The robust prior detection of mouth corners and Cupidon's arch yields the automatic positioning of the initial shape which is very difficult and must be as accurate as possible. Temporal information integration comes from the definition of Kalman filters on the independent mouth parameters. Such filtering gives an initial shape close to the final one which speeds up the convergence rate. We point out on the behaviour of our algorithm when a transition open mouth/closed mouth or closed mouth/open mouth occurs.