{"title":"Modeling of linguistic and acoustic information from speech signal for multilingual spoken language identification system (SLID)","authors":"S. Bansal, S. Agrawal","doi":"10.1109/ICSDA.2017.8384468","DOIUrl":null,"url":null,"abstract":"Spoken language identification is the task of identifying a language from the given speech signal. Efforts to develop language identification systems for Indian languages have been very limited due to the problem of speaker availability and language legibility but the requirement of SLID is increasing for civil and defense applications day by day. The present paper reports a study to develop a multilingual identification system for two Indian languages i.e. Hindi and Manipuri by using PPRLM approach that requires phoneme based labeled speech corpus for each language. For each language, data set of 300 phonetically rich sentences spoken by 25 native speakers (15000 utterances) were recorded, analyzed and annotated phonemically to make trigram based phonotactic model. The features of the speech signal have been extracted using MFCCs and GMM was used as a classifier. Results show that accuracy increases with the increase of Gaussians and also with the training samples.","PeriodicalId":255147,"journal":{"name":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","volume":"187 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSDA.2017.8384468","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Spoken language identification is the task of identifying a language from the given speech signal. Efforts to develop language identification systems for Indian languages have been very limited due to the problem of speaker availability and language legibility but the requirement of SLID is increasing for civil and defense applications day by day. The present paper reports a study to develop a multilingual identification system for two Indian languages i.e. Hindi and Manipuri by using PPRLM approach that requires phoneme based labeled speech corpus for each language. For each language, data set of 300 phonetically rich sentences spoken by 25 native speakers (15000 utterances) were recorded, analyzed and annotated phonemically to make trigram based phonotactic model. The features of the speech signal have been extracted using MFCCs and GMM was used as a classifier. Results show that accuracy increases with the increase of Gaussians and also with the training samples.