{"title":"基于Delta神经网络学习规则的氡-离散余弦变换语音识别","authors":"Osama Q. Al-Thahab","doi":"10.1109/ISFEE.2016.7803208","DOIUrl":null,"url":null,"abstract":"The recognition of seven recorded words in this paper is proposed by using a Radon and Discrete Cosine Transforms. The Radon Transform is used to reorder the data input with a new shape so that each voice maintain the same number of samples approximately, while the second frequency transform is used to minimize the dimensions of each audio signal to a small number of samples. The goal of this method is to rise the number of recognized audio signals and consequently increasing the database. The learning rule of Delta Neural Network is used for recognition with the assistance of multi neurons of a single layer such that the number of audio signals (recorded words) are coincide the number of neurons. The results will be compared based on the learning speed. The proposed system also examined by a test audio signal. Here, seven different words are recorded. Eight different persons (men and women) recorded these different words, so that there are 56 audio signal. Each eight signals belongs to the selfsame word; consequently, the outputs of these eight audio signals are the same.","PeriodicalId":240170,"journal":{"name":"2016 International Symposium on Fundamentals of Electrical Engineering (ISFEE)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Speech recognition based Radon-Discrete Cosine Transforms by Delta Neural Network learning rule\",\"authors\":\"Osama Q. Al-Thahab\",\"doi\":\"10.1109/ISFEE.2016.7803208\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The recognition of seven recorded words in this paper is proposed by using a Radon and Discrete Cosine Transforms. The Radon Transform is used to reorder the data input with a new shape so that each voice maintain the same number of samples approximately, while the second frequency transform is used to minimize the dimensions of each audio signal to a small number of samples. The goal of this method is to rise the number of recognized audio signals and consequently increasing the database. The learning rule of Delta Neural Network is used for recognition with the assistance of multi neurons of a single layer such that the number of audio signals (recorded words) are coincide the number of neurons. The results will be compared based on the learning speed. The proposed system also examined by a test audio signal. Here, seven different words are recorded. Eight different persons (men and women) recorded these different words, so that there are 56 audio signal. Each eight signals belongs to the selfsame word; consequently, the outputs of these eight audio signals are the same.\",\"PeriodicalId\":240170,\"journal\":{\"name\":\"2016 International Symposium on Fundamentals of Electrical Engineering (ISFEE)\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Symposium on Fundamentals of Electrical Engineering (ISFEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISFEE.2016.7803208\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Symposium on Fundamentals of Electrical Engineering (ISFEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISFEE.2016.7803208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Speech recognition based Radon-Discrete Cosine Transforms by Delta Neural Network learning rule
The recognition of seven recorded words in this paper is proposed by using a Radon and Discrete Cosine Transforms. The Radon Transform is used to reorder the data input with a new shape so that each voice maintain the same number of samples approximately, while the second frequency transform is used to minimize the dimensions of each audio signal to a small number of samples. The goal of this method is to rise the number of recognized audio signals and consequently increasing the database. The learning rule of Delta Neural Network is used for recognition with the assistance of multi neurons of a single layer such that the number of audio signals (recorded words) are coincide the number of neurons. The results will be compared based on the learning speed. The proposed system also examined by a test audio signal. Here, seven different words are recorded. Eight different persons (men and women) recorded these different words, so that there are 56 audio signal. Each eight signals belongs to the selfsame word; consequently, the outputs of these eight audio signals are the same.