词汇量大的识别系统中的说话人自适应

F. Class, P. Regel, K. Trottler
{"title":"词汇量大的识别系统中的说话人自适应","authors":"F. Class, P. Regel, K. Trottler","doi":"10.1109/MELCON.1989.50027","DOIUrl":null,"url":null,"abstract":"Algorithms for a fast speaker adaptation in a speech-recognition system are described. The techniques aim at transformations of the feature vectors, which have to be optimized with respect to some constraints. The methods transform every feature vector, computed in a 10-ms frame rate, into a speaker-normalized vector. The advantage of adaptation by transforming the feature vectors is that this procedure can be applied no matter which classification scheme is used. It is shown that, by means of adaptation procedures based on statistical correlation analysis, error rates as low as those of a speaker-dependent recognition system can be achieved after an extremely short training phase with any new speaker. The key is that the feature vectors are extended nonlinearly to a polynomial vector of second or higher order. Since the algorithms necessary for calculating the transformation matrices are typical for signal processing a real-time implementation on digital signal processors appears feasible.<<ETX>>","PeriodicalId":380214,"journal":{"name":"Proceedings. Electrotechnical Conference Integrating Research, Industry and Education in Energy and Communication Engineering',","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1989-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Speaker adaptation for recognition systems with a large vocabulary\",\"authors\":\"F. Class, P. Regel, K. Trottler\",\"doi\":\"10.1109/MELCON.1989.50027\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Algorithms for a fast speaker adaptation in a speech-recognition system are described. The techniques aim at transformations of the feature vectors, which have to be optimized with respect to some constraints. The methods transform every feature vector, computed in a 10-ms frame rate, into a speaker-normalized vector. The advantage of adaptation by transforming the feature vectors is that this procedure can be applied no matter which classification scheme is used. It is shown that, by means of adaptation procedures based on statistical correlation analysis, error rates as low as those of a speaker-dependent recognition system can be achieved after an extremely short training phase with any new speaker. The key is that the feature vectors are extended nonlinearly to a polynomial vector of second or higher order. Since the algorithms necessary for calculating the transformation matrices are typical for signal processing a real-time implementation on digital signal processors appears feasible.<<ETX>>\",\"PeriodicalId\":380214,\"journal\":{\"name\":\"Proceedings. Electrotechnical Conference Integrating Research, Industry and Education in Energy and Communication Engineering',\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1989-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. Electrotechnical Conference Integrating Research, Industry and Education in Energy and Communication Engineering',\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MELCON.1989.50027\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. Electrotechnical Conference Integrating Research, Industry and Education in Energy and Communication Engineering',","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MELCON.1989.50027","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

介绍了语音识别系统中快速自适应说话人的算法。这些技术的目标是特征向量的变换,必须根据一些约束条件对其进行优化。该方法将以10ms帧速率计算的每个特征向量转换为说话人归一化向量。通过变换特征向量进行自适应的优点是,无论采用哪种分类方案,该方法都可以适用。研究表明,通过基于统计相关分析的自适应过程,在对任何新说话人进行极短的训练阶段后,都可以达到与依赖说话人识别系统一样低的错误率。关键是将特征向量非线性地扩展为二阶或更高阶的多项式向量。由于计算变换矩阵所需的算法是信号处理的典型算法,因此在数字信号处理器上实时实现似乎是可行的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Speaker adaptation for recognition systems with a large vocabulary
Algorithms for a fast speaker adaptation in a speech-recognition system are described. The techniques aim at transformations of the feature vectors, which have to be optimized with respect to some constraints. The methods transform every feature vector, computed in a 10-ms frame rate, into a speaker-normalized vector. The advantage of adaptation by transforming the feature vectors is that this procedure can be applied no matter which classification scheme is used. It is shown that, by means of adaptation procedures based on statistical correlation analysis, error rates as low as those of a speaker-dependent recognition system can be achieved after an extremely short training phase with any new speaker. The key is that the feature vectors are extended nonlinearly to a polynomial vector of second or higher order. Since the algorithms necessary for calculating the transformation matrices are typical for signal processing a real-time implementation on digital signal processors appears feasible.<>
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Hydrothermal scheduling of power system with stochastic inflows A multiple user variable rate speech coding for packet transmission systems Software engineering: how to approach it the electrical engineer Crosstalk in digital transmission over subscriber cable Developments in power converters
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1