{"title":"Spoken Language Identification Using a New Sequence Kernel-based SVM Back-end Classifier","authors":"A. Ziaei, S. Ahadi, S. M. Mirrezaie, H. Yeganeh","doi":"10.1109/ISSPIT.2008.4775713","DOIUrl":null,"url":null,"abstract":"In this paper, we present a new back-end classifier for GMM-LM based language identification systems. Our new proposed system consists of two main parts, mapping matrix and bank of SVMs. These two parts are located in series after GMM-LM system. The mapping matrix, maps the language models' output vectors to a new space in which the languages are more separable than before. Then each SVM in the SVM bank separates one language from the others. We used a new sequence kernel for each SVM in the bank. We show that our new sequence kernel-based SVMs separate languages more efficiently than common Gaussian mixture and GLDS SVM back-end classifiers. Also our new mapping matrix outperforms common linear discriminant matrix in separating classes from each other. Using these two parts increases the LID accuracy noticeably in comparison with the other LDA-GMM and LDA-GLDS SVM back-end classifiers. Our experiments on 5 languages from OGI-TS multilanguage task, prove our claim.","PeriodicalId":213756,"journal":{"name":"2008 IEEE International Symposium on Signal Processing and Information Technology","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Symposium on Signal Processing and Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSPIT.2008.4775713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper, we present a new back-end classifier for GMM-LM based language identification systems. Our new proposed system consists of two main parts, mapping matrix and bank of SVMs. These two parts are located in series after GMM-LM system. The mapping matrix, maps the language models' output vectors to a new space in which the languages are more separable than before. Then each SVM in the SVM bank separates one language from the others. We used a new sequence kernel for each SVM in the bank. We show that our new sequence kernel-based SVMs separate languages more efficiently than common Gaussian mixture and GLDS SVM back-end classifiers. Also our new mapping matrix outperforms common linear discriminant matrix in separating classes from each other. Using these two parts increases the LID accuracy noticeably in comparison with the other LDA-GMM and LDA-GLDS SVM back-end classifiers. Our experiments on 5 languages from OGI-TS multilanguage task, prove our claim.