Time scaling algorithm of speech signal to assist learning of a foreign language

P. Rodríguez-Peralta, M. Nakano-Miyatake, H. Perez-Meana, G. Duchén-Sánchez
{"title":"Time scaling algorithm of speech signal to assist learning of a foreign language","authors":"P. Rodríguez-Peralta, M. Nakano-Miyatake, H. Perez-Meana, G. Duchén-Sánchez","doi":"10.1109/ISIE.2000.930363","DOIUrl":null,"url":null,"abstract":"For basic level students of foreign languages, normal speed speaking is very difficult to understand perfectly, because of lack of training in understanding of oral language. However when the speed of speaking slows down, in most cases understanding increases. This fact suggests that to improve learning of the foreign language, it is necessary that students can adjust the speed of speaking according to their own understanding level. This paper presents a comparison of two time scaling algorithms when they are used to assist learning of a foreign language. Both algorithms consist of a pitch detection stage and time scaling stage. The pitch detection of both algorithms is based on autocorrelation method of the speech signals proposed by Rabiner et. al. (1976). The time scaling in the first method consists in duplicating the pitch periods of voiced segments while keeping unchanged unvoiced ones. The second method is based on the short time Fourier transform. Experimental results, MOS (mean opinion scoring) are given using Spanish, French, German, Russian, Japanese and Italian which show desirable features of both time scaling algorithms when they are used to assist the students to learn foreign languages. The performance of both algorithms when the pitch detection stage has some noise is also shown.","PeriodicalId":298625,"journal":{"name":"ISIE'2000. Proceedings of the 2000 IEEE International Symposium on Industrial Electronics (Cat. No.00TH8543)","volume":"185 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2000-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISIE'2000. Proceedings of the 2000 IEEE International Symposium on Industrial Electronics (Cat. No.00TH8543)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISIE.2000.930363","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

For basic level students of foreign languages, normal speed speaking is very difficult to understand perfectly, because of lack of training in understanding of oral language. However when the speed of speaking slows down, in most cases understanding increases. This fact suggests that to improve learning of the foreign language, it is necessary that students can adjust the speed of speaking according to their own understanding level. This paper presents a comparison of two time scaling algorithms when they are used to assist learning of a foreign language. Both algorithms consist of a pitch detection stage and time scaling stage. The pitch detection of both algorithms is based on autocorrelation method of the speech signals proposed by Rabiner et. al. (1976). The time scaling in the first method consists in duplicating the pitch periods of voiced segments while keeping unchanged unvoiced ones. The second method is based on the short time Fourier transform. Experimental results, MOS (mean opinion scoring) are given using Spanish, French, German, Russian, Japanese and Italian which show desirable features of both time scaling algorithms when they are used to assist the students to learn foreign languages. The performance of both algorithms when the pitch detection stage has some noise is also shown.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
语音信号的时间尺度算法,以帮助学习一门外语
对于外语基础水平的学生来说,由于缺乏对口语理解的训练,正常语速说话很难完全理解。然而,在大多数情况下,当说话速度放慢时,理解能力会提高。这一事实表明,为了提高外语的学习,学生有必要根据自己的理解水平调整说话的速度。本文介绍了两种时间尺度算法在辅助外语学习中的比较。这两种算法都包括一个基音检测阶段和一个时间尺度阶段。两种算法的基音检测都基于Rabiner et. al.(1976)提出的语音信号的自相关方法。第一种方法的时间尺度是复制浊音段的音高周期,同时保持未浊音段不变。第二种方法是基于短时傅里叶变换。实验结果显示,使用西班牙语、法语、德语、俄语、日语和意大利语进行的MOS(平均意见评分)在帮助学生学习外语时显示出两种时间尺度算法的理想特征。给出了两种算法在基音检测阶段存在噪声时的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Engineering of distributed control systems Modeling conducted EMI noise generation and propagation in boost converters Application of hybrid power systems of low power to the remote radio equipment telecommunication Identification of nonlinear systems based on the bispectrum and a second order Volterra model A fuzzy logic feed-forward current controller for PWM rectifiers
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1