唇形检测与跟踪

Proceedings 11th International Conference on Image Analysis and Processing Pub Date : 2001-09-26 DOI:10.1109/ICIAP.2001.956978

A. Caplier

{"title":"唇形检测与跟踪","authors":"A. Caplier","doi":"10.1109/ICIAP.2001.956978","DOIUrl":null,"url":null,"abstract":"Seeing the talker's lips in addition to audition can improve speech understanding which is rather based on lip shape temporal evolution than on absolute mouth shape. We propose a totally automatic algorithm which can extract lip shape over an image sequence. The algorithm does not require any make-up or markers and works under natural lighting conditions. The lip detection algorithm uses an active shape model to describe the mouth. After a training step, the mouth model is iteratively deformed under constraints according to spatiotemporal energies. The robust prior detection of mouth corners and Cupidon's arch yields the automatic positioning of the initial shape which is very difficult and must be as accurate as possible. Temporal information integration comes from the definition of Kalman filters on the independent mouth parameters. Such filtering gives an initial shape close to the final one which speeds up the convergence rate. We point out on the behaviour of our algorithm when a transition open mouth/closed mouth or closed mouth/open mouth occurs.","PeriodicalId":365627,"journal":{"name":"Proceedings 11th International Conference on Image Analysis and Processing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"34","resultStr":"{\"title\":\"Lip detection and tracking\",\"authors\":\"A. Caplier\",\"doi\":\"10.1109/ICIAP.2001.956978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Seeing the talker's lips in addition to audition can improve speech understanding which is rather based on lip shape temporal evolution than on absolute mouth shape. We propose a totally automatic algorithm which can extract lip shape over an image sequence. The algorithm does not require any make-up or markers and works under natural lighting conditions. The lip detection algorithm uses an active shape model to describe the mouth. After a training step, the mouth model is iteratively deformed under constraints according to spatiotemporal energies. The robust prior detection of mouth corners and Cupidon's arch yields the automatic positioning of the initial shape which is very difficult and must be as accurate as possible. Temporal information integration comes from the definition of Kalman filters on the independent mouth parameters. Such filtering gives an initial shape close to the final one which speeds up the convergence rate. We point out on the behaviour of our algorithm when a transition open mouth/closed mouth or closed mouth/open mouth occurs.\",\"PeriodicalId\":365627,\"journal\":{\"name\":\"Proceedings 11th International Conference on Image Analysis and Processing\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2001-09-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"34\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 11th International Conference on Image Analysis and Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIAP.2001.956978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 11th International Conference on Image Analysis and Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAP.2001.956978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 34

摘要

在听音的基础上看说话人的嘴唇可以提高对言语的理解，这是基于唇形的时间演变，而不是绝对的嘴型。提出了一种完全自动的唇形提取算法。该算法不需要任何化妆或标记，并在自然光条件下工作。唇形检测算法采用主动形状模型来描述口腔。经过一个训练步骤，嘴巴模型在时空能量约束下迭代变形。对嘴角和丘比顿弧度的鲁棒先验检测产生了初始形状的自动定位，这是非常困难的，必须尽可能准确。时间信息集成来自于对独立口参数的卡尔曼滤波的定义。这种滤波使初始形状接近最终形状，从而加快了收敛速度。我们指出了我们的算法在张嘴/闭口或闭口/张嘴发生转换时的行为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Lip detection and tracking

Seeing the talker's lips in addition to audition can improve speech understanding which is rather based on lip shape temporal evolution than on absolute mouth shape. We propose a totally automatic algorithm which can extract lip shape over an image sequence. The algorithm does not require any make-up or markers and works under natural lighting conditions. The lip detection algorithm uses an active shape model to describe the mouth. After a training step, the mouth model is iteratively deformed under constraints according to spatiotemporal energies. The robust prior detection of mouth corners and Cupidon's arch yields the automatic positioning of the initial shape which is very difficult and must be as accurate as possible. Temporal information integration comes from the definition of Kalman filters on the independent mouth parameters. Such filtering gives an initial shape close to the final one which speeds up the convergence rate. We point out on the behaviour of our algorithm when a transition open mouth/closed mouth or closed mouth/open mouth occurs.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings 11th International Conference on Image Analysis and Processing

自引率

0.00%

发文量