{"title":"电声门仪作为孤立词识别的附加信息来源","authors":"P. Dikshit, R. W. Schubert","doi":"10.1109/SBEC.1995.514413","DOIUrl":null,"url":null,"abstract":"Traditionally, speech recognition systems use only the acoustic speech signal (speech). However, the source of the signal and the way speech is produced and whether this information can aid in speech recognition needs to be investigated. The objective of this study was to assess the contribution of using the electroglottograph (EGG) as an additional source of information along with speech in an isolated word recognition system. The vocabulary consisted of 64 words, ranging from mono-syllabic words to words with four syllables. Two fully connected artificial neural networks were designed. One network (speech network) used only speech as its source of information. The other network (speech+EGG network) used EGG along with the acoustic speech signal as its source of information. The speech network had a peak recognition rate of 94.37%. The speech+EGG network had a peak recognition rate of 99.37%. Hence, the information provided by the EGG improved the performance of the speech recognition system by 5%.","PeriodicalId":332563,"journal":{"name":"Proceedings of the 1995 Fourteenth Southern Biomedical Engineering Conference","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Electroglottograph as an additional source of information in isolated word recognition\",\"authors\":\"P. Dikshit, R. W. Schubert\",\"doi\":\"10.1109/SBEC.1995.514413\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Traditionally, speech recognition systems use only the acoustic speech signal (speech). However, the source of the signal and the way speech is produced and whether this information can aid in speech recognition needs to be investigated. The objective of this study was to assess the contribution of using the electroglottograph (EGG) as an additional source of information along with speech in an isolated word recognition system. The vocabulary consisted of 64 words, ranging from mono-syllabic words to words with four syllables. Two fully connected artificial neural networks were designed. One network (speech network) used only speech as its source of information. The other network (speech+EGG network) used EGG along with the acoustic speech signal as its source of information. The speech network had a peak recognition rate of 94.37%. The speech+EGG network had a peak recognition rate of 99.37%. Hence, the information provided by the EGG improved the performance of the speech recognition system by 5%.\",\"PeriodicalId\":332563,\"journal\":{\"name\":\"Proceedings of the 1995 Fourteenth Southern Biomedical Engineering Conference\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1995-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1995 Fourteenth Southern Biomedical Engineering Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SBEC.1995.514413\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1995 Fourteenth Southern Biomedical Engineering Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SBEC.1995.514413","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Electroglottograph as an additional source of information in isolated word recognition
Traditionally, speech recognition systems use only the acoustic speech signal (speech). However, the source of the signal and the way speech is produced and whether this information can aid in speech recognition needs to be investigated. The objective of this study was to assess the contribution of using the electroglottograph (EGG) as an additional source of information along with speech in an isolated word recognition system. The vocabulary consisted of 64 words, ranging from mono-syllabic words to words with four syllables. Two fully connected artificial neural networks were designed. One network (speech network) used only speech as its source of information. The other network (speech+EGG network) used EGG along with the acoustic speech signal as its source of information. The speech network had a peak recognition rate of 94.37%. The speech+EGG network had a peak recognition rate of 99.37%. Hence, the information provided by the EGG improved the performance of the speech recognition system by 5%.