Khaled Lounnas, H. Satori, H. Teffahi, Mourad Abbas, Mohamed Lichouri
{"title":"CLIASR:一种组合的自动语音识别和语言识别系统","authors":"Khaled Lounnas, H. Satori, H. Teffahi, Mourad Abbas, Mohamed Lichouri","doi":"10.1109/IRASET48871.2020.9092020","DOIUrl":null,"url":null,"abstract":"In this paper, we aim to describe a novel technique which is a Combined Automatic Speech Recognition and Language Identification System that uses both ASR and LI technologies which consists of the recognition of spoken digits after identifying their language. An in-house corpus was used mainly for both speech-based multi-lingual identification and speech recognition tasks made of bilingual digits sounds that contains ten digits spoken mainly in two languages: Modern Standard Arabic (MSA) and Amazigh Moroccan dialect. First of all, we develop the Language Identification stage which is the basis of our hybrid system which behaves as front-end of our system that serves for spoken language detection. This facilitates the task of recognition by allocating the output to the appropriate Hidden Model Markov (HMM) based recognition system (Arabic or Amazigh) which improves recognition of a bilingual spoken digit efficiently. For this purpose, a set of parameters were adjusted on our CLIASR system to achieve good results, including classifier parameters, feature vector, and HMM-GMM parameters. The results show that our proposed LI-ASR system performs 33% better than an ordinary ASR for a given bilingual mixed speech corpus.","PeriodicalId":271840,"journal":{"name":"2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"CLIASR: A Combined Automatic Speech Recognition and Language Identification System\",\"authors\":\"Khaled Lounnas, H. Satori, H. Teffahi, Mourad Abbas, Mohamed Lichouri\",\"doi\":\"10.1109/IRASET48871.2020.9092020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we aim to describe a novel technique which is a Combined Automatic Speech Recognition and Language Identification System that uses both ASR and LI technologies which consists of the recognition of spoken digits after identifying their language. An in-house corpus was used mainly for both speech-based multi-lingual identification and speech recognition tasks made of bilingual digits sounds that contains ten digits spoken mainly in two languages: Modern Standard Arabic (MSA) and Amazigh Moroccan dialect. First of all, we develop the Language Identification stage which is the basis of our hybrid system which behaves as front-end of our system that serves for spoken language detection. This facilitates the task of recognition by allocating the output to the appropriate Hidden Model Markov (HMM) based recognition system (Arabic or Amazigh) which improves recognition of a bilingual spoken digit efficiently. For this purpose, a set of parameters were adjusted on our CLIASR system to achieve good results, including classifier parameters, feature vector, and HMM-GMM parameters. The results show that our proposed LI-ASR system performs 33% better than an ordinary ASR for a given bilingual mixed speech corpus.\",\"PeriodicalId\":271840,\"journal\":{\"name\":\"2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRASET48871.2020.9092020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 1st International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRASET48871.2020.9092020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
CLIASR: A Combined Automatic Speech Recognition and Language Identification System
In this paper, we aim to describe a novel technique which is a Combined Automatic Speech Recognition and Language Identification System that uses both ASR and LI technologies which consists of the recognition of spoken digits after identifying their language. An in-house corpus was used mainly for both speech-based multi-lingual identification and speech recognition tasks made of bilingual digits sounds that contains ten digits spoken mainly in two languages: Modern Standard Arabic (MSA) and Amazigh Moroccan dialect. First of all, we develop the Language Identification stage which is the basis of our hybrid system which behaves as front-end of our system that serves for spoken language detection. This facilitates the task of recognition by allocating the output to the appropriate Hidden Model Markov (HMM) based recognition system (Arabic or Amazigh) which improves recognition of a bilingual spoken digit efficiently. For this purpose, a set of parameters were adjusted on our CLIASR system to achieve good results, including classifier parameters, feature vector, and HMM-GMM parameters. The results show that our proposed LI-ASR system performs 33% better than an ordinary ASR for a given bilingual mixed speech corpus.