P. Rodríguez-Peralta, M. Nakano-Miyatake, H. Perez-Meana, G. Duchén-Sánchez
{"title":"语音信号的时间尺度算法,以帮助学习一门外语","authors":"P. Rodríguez-Peralta, M. Nakano-Miyatake, H. Perez-Meana, G. Duchén-Sánchez","doi":"10.1109/ISIE.2000.930363","DOIUrl":null,"url":null,"abstract":"For basic level students of foreign languages, normal speed speaking is very difficult to understand perfectly, because of lack of training in understanding of oral language. However when the speed of speaking slows down, in most cases understanding increases. This fact suggests that to improve learning of the foreign language, it is necessary that students can adjust the speed of speaking according to their own understanding level. This paper presents a comparison of two time scaling algorithms when they are used to assist learning of a foreign language. Both algorithms consist of a pitch detection stage and time scaling stage. The pitch detection of both algorithms is based on autocorrelation method of the speech signals proposed by Rabiner et. al. (1976). The time scaling in the first method consists in duplicating the pitch periods of voiced segments while keeping unchanged unvoiced ones. The second method is based on the short time Fourier transform. Experimental results, MOS (mean opinion scoring) are given using Spanish, French, German, Russian, Japanese and Italian which show desirable features of both time scaling algorithms when they are used to assist the students to learn foreign languages. The performance of both algorithms when the pitch detection stage has some noise is also shown.","PeriodicalId":298625,"journal":{"name":"ISIE'2000. Proceedings of the 2000 IEEE International Symposium on Industrial Electronics (Cat. No.00TH8543)","volume":"185 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Time scaling algorithm of speech signal to assist learning of a foreign language\",\"authors\":\"P. Rodríguez-Peralta, M. Nakano-Miyatake, H. Perez-Meana, G. Duchén-Sánchez\",\"doi\":\"10.1109/ISIE.2000.930363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For basic level students of foreign languages, normal speed speaking is very difficult to understand perfectly, because of lack of training in understanding of oral language. However when the speed of speaking slows down, in most cases understanding increases. This fact suggests that to improve learning of the foreign language, it is necessary that students can adjust the speed of speaking according to their own understanding level. This paper presents a comparison of two time scaling algorithms when they are used to assist learning of a foreign language. Both algorithms consist of a pitch detection stage and time scaling stage. The pitch detection of both algorithms is based on autocorrelation method of the speech signals proposed by Rabiner et. al. (1976). The time scaling in the first method consists in duplicating the pitch periods of voiced segments while keeping unchanged unvoiced ones. The second method is based on the short time Fourier transform. Experimental results, MOS (mean opinion scoring) are given using Spanish, French, German, Russian, Japanese and Italian which show desirable features of both time scaling algorithms when they are used to assist the students to learn foreign languages. The performance of both algorithms when the pitch detection stage has some noise is also shown.\",\"PeriodicalId\":298625,\"journal\":{\"name\":\"ISIE'2000. Proceedings of the 2000 IEEE International Symposium on Industrial Electronics (Cat. No.00TH8543)\",\"volume\":\"185 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2000-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISIE'2000. Proceedings of the 2000 IEEE International Symposium on Industrial Electronics (Cat. No.00TH8543)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISIE.2000.930363\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISIE'2000. Proceedings of the 2000 IEEE International Symposium on Industrial Electronics (Cat. No.00TH8543)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIE.2000.930363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Time scaling algorithm of speech signal to assist learning of a foreign language
For basic level students of foreign languages, normal speed speaking is very difficult to understand perfectly, because of lack of training in understanding of oral language. However when the speed of speaking slows down, in most cases understanding increases. This fact suggests that to improve learning of the foreign language, it is necessary that students can adjust the speed of speaking according to their own understanding level. This paper presents a comparison of two time scaling algorithms when they are used to assist learning of a foreign language. Both algorithms consist of a pitch detection stage and time scaling stage. The pitch detection of both algorithms is based on autocorrelation method of the speech signals proposed by Rabiner et. al. (1976). The time scaling in the first method consists in duplicating the pitch periods of voiced segments while keeping unchanged unvoiced ones. The second method is based on the short time Fourier transform. Experimental results, MOS (mean opinion scoring) are given using Spanish, French, German, Russian, Japanese and Italian which show desirable features of both time scaling algorithms when they are used to assist the students to learn foreign languages. The performance of both algorithms when the pitch detection stage has some noise is also shown.