Discriminant local information distance preserving projection for text-independent speaker recognition

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI:10.1109/ISCSLP.2012.6423466

Liang He, Jia Li

引用次数: 1

Abstract

A novel method is presented based on a statistical manifold for text-independent speaker recognition. After feature extraction, speaker recognition becomes a sequence classification problem. By discarding time information, the core task is the comparison of multiple sample sets. Each set is assumed to be governed by a probability density function (PDF). We estimate the PDFs and place the estimated statistical models on a statistical manifold. Fisher information distance is applied to compute distance between adjacent PDFs. Discriminant local preserving projection is used to push adjacent PDFs which belong to different classes apart to further improve the recognition accuracy. Experiments were carried out on the NIST SRE08 tel-tel database. Our presented method gave an excellent performance.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

独立文本说话人识别的判别局部信息距离保持投影

提出了一种基于统计流形的独立文本说话人识别方法。经过特征提取，说话人识别成为一个序列分类问题。通过丢弃时间信息，核心任务是多个样本集的比较。假设每个集合都由概率密度函数(PDF)控制。我们估计pdf并将估计的统计模型放在统计流形上。Fisher信息距离用于计算相邻pdf之间的距离。采用判别局部保留投影将相邻的不同类别的pdf推开，进一步提高识别精度。实验在NIST的SRE08电话-电话数据库上进行。我们提出的方法取得了很好的效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2012 8th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量