{"title":"基于GMM超向量和人工神经网络的印度口语分类","authors":"A. Bakshi, S. Kopparapu","doi":"10.1109/IBSSC47189.2019.8972979","DOIUrl":null,"url":null,"abstract":"Indian languages are phonetic in nature; phonetics is branch of linguistics which studies the structure of human language sound. Acoustic phonetic features associated with languages play an important role in spoken language identification. In this paper, Gaussian Mixture Model supervectors is used to capture acoustic phonetic variation in Indian languages. Mel frequency cepstral coefficient (MFCC) with delta coefficients is used to represent the language specific acoustic phonetic information of speech and artificial neural network ANN is used as a classifier for language identification. In the present work, we have conducted extensive experiments for three different datasets created from the news broadcast in different Indian languages from All India Radio. The performance of ANN classifier using GMM supervectors is evaluated on these three datasets.","PeriodicalId":148941,"journal":{"name":"2019 IEEE Bombay Section Signature Conference (IBSSC)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Spoken Indian Language Classification using GMM supervectors and Artificial Neural Networks\",\"authors\":\"A. Bakshi, S. Kopparapu\",\"doi\":\"10.1109/IBSSC47189.2019.8972979\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Indian languages are phonetic in nature; phonetics is branch of linguistics which studies the structure of human language sound. Acoustic phonetic features associated with languages play an important role in spoken language identification. In this paper, Gaussian Mixture Model supervectors is used to capture acoustic phonetic variation in Indian languages. Mel frequency cepstral coefficient (MFCC) with delta coefficients is used to represent the language specific acoustic phonetic information of speech and artificial neural network ANN is used as a classifier for language identification. In the present work, we have conducted extensive experiments for three different datasets created from the news broadcast in different Indian languages from All India Radio. The performance of ANN classifier using GMM supervectors is evaluated on these three datasets.\",\"PeriodicalId\":148941,\"journal\":{\"name\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE Bombay Section Signature Conference (IBSSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IBSSC47189.2019.8972979\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE Bombay Section Signature Conference (IBSSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IBSSC47189.2019.8972979","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Spoken Indian Language Classification using GMM supervectors and Artificial Neural Networks
Indian languages are phonetic in nature; phonetics is branch of linguistics which studies the structure of human language sound. Acoustic phonetic features associated with languages play an important role in spoken language identification. In this paper, Gaussian Mixture Model supervectors is used to capture acoustic phonetic variation in Indian languages. Mel frequency cepstral coefficient (MFCC) with delta coefficients is used to represent the language specific acoustic phonetic information of speech and artificial neural network ANN is used as a classifier for language identification. In the present work, we have conducted extensive experiments for three different datasets created from the news broadcast in different Indian languages from All India Radio. The performance of ANN classifier using GMM supervectors is evaluated on these three datasets.