Automatic Extraction of Geometric Lip Features with Application to Multi-Modal Speaker Identification

2006 IEEE International Conference on Multimedia and Expo Pub Date : 2006-07-09 DOI:10.1109/ICME.2006.262594

Ivana Arsic, Roger Vilagut, J. Thiran

引用次数: 13

Abstract

In this paper we consider the problem of automatic extraction of the geometric lip features for the purposes of multi-modal speaker identification. The use of visual information from the mouth region can be of great importance for improving the speaker identification system performance in noisy conditions. We propose a novel method for automated lip features extraction that utilizes color space transformation and a fuzzy-based c-means clustering technique. Using the obtained visual cues closed-set audio-visual speaker identification experiments are performed on the CUAVE database, showing promising results

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

唇形几何特征自动提取及其在多模态说话人识别中的应用

本文研究了基于多模态说话人识别的几何唇形特征自动提取问题。利用口腔区域的视觉信息对于提高噪声条件下说话人识别系统的性能具有重要意义。我们提出了一种利用颜色空间变换和基于模糊的c均值聚类技术的唇特征自动提取的新方法。利用获得的视觉线索，在CUAVE数据库上进行了闭集视听说话人识别实验，取得了令人满意的结果

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2006 IEEE International Conference on Multimedia and Expo

自引率

0.00%

发文量