{"title":"Accent Identification of Ethnically Diverse Nigerian English Speakers","authors":"Francisca O. Oladipo, Rahmon A Habeeb, A. Musa","doi":"10.2139/ssrn.3666815","DOIUrl":null,"url":null,"abstract":"It is imperative to improve the speech recognition system as human-machine interfaces are advancing in the growing global market of technologies. There are quite a number of Nigerian English speakers’ accents to which the speech recognition systems are not sufficiently exposed. Accents may suggest a lot of information about someone’s whereabouts, for example, their native language, place of origin, or ethnic groups and accent classification. Given the importance of accents, efficiency and accuracy of speech recognition systems can be improved with training data of diverse accents. This research provides support for accent-dependent automatic speech recognition by deploying a supervised learning algorithm to the task of recognizing three Nigerian ethnic groups (Yoruba, Igbo, and Hausa) and distinguish them based on their accents by constructing sequential Mel-Frequency Cepstral Coefficients (MFCC) features from the frames of the audio sample. Our results show that concatenating the MFCC features sequentially and applying a supervised learning technique to provide a solution to the problem of identifying and classifying accents works efficiently and accurately.","PeriodicalId":102139,"journal":{"name":"Other Topics Engineering Research eJournal","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Other Topics Engineering Research eJournal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3666815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
It is imperative to improve the speech recognition system as human-machine interfaces are advancing in the growing global market of technologies. There are quite a number of Nigerian English speakers’ accents to which the speech recognition systems are not sufficiently exposed. Accents may suggest a lot of information about someone’s whereabouts, for example, their native language, place of origin, or ethnic groups and accent classification. Given the importance of accents, efficiency and accuracy of speech recognition systems can be improved with training data of diverse accents. This research provides support for accent-dependent automatic speech recognition by deploying a supervised learning algorithm to the task of recognizing three Nigerian ethnic groups (Yoruba, Igbo, and Hausa) and distinguish them based on their accents by constructing sequential Mel-Frequency Cepstral Coefficients (MFCC) features from the frames of the audio sample. Our results show that concatenating the MFCC features sequentially and applying a supervised learning technique to provide a solution to the problem of identifying and classifying accents works efficiently and accurately.