{"title":"Discriminant local information distance preserving projection for text-independent speaker recognition","authors":"Liang He, Jia Li","doi":"10.1109/ISCSLP.2012.6423466","DOIUrl":null,"url":null,"abstract":"A novel method is presented based on a statistical manifold for text-independent speaker recognition. After feature extraction, speaker recognition becomes a sequence classification problem. By discarding time information, the core task is the comparison of multiple sample sets. Each set is assumed to be governed by a probability density function (PDF). We estimate the PDFs and place the estimated statistical models on a statistical manifold. Fisher information distance is applied to compute distance between adjacent PDFs. Discriminant local preserving projection is used to push adjacent PDFs which belong to different classes apart to further improve the recognition accuracy. Experiments were carried out on the NIST SRE08 tel-tel database. Our presented method gave an excellent performance.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"482 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
A novel method is presented based on a statistical manifold for text-independent speaker recognition. After feature extraction, speaker recognition becomes a sequence classification problem. By discarding time information, the core task is the comparison of multiple sample sets. Each set is assumed to be governed by a probability density function (PDF). We estimate the PDFs and place the estimated statistical models on a statistical manifold. Fisher information distance is applied to compute distance between adjacent PDFs. Discriminant local preserving projection is used to push adjacent PDFs which belong to different classes apart to further improve the recognition accuracy. Experiments were carried out on the NIST SRE08 tel-tel database. Our presented method gave an excellent performance.