LabVIEW and digital signal processor implementation of a channel vocoder based model of a cochlear implant

G. Rachel, S. J. J. Singh, P. Vijayalakshmi
{"title":"LabVIEW and digital signal processor implementation of a channel vocoder based model of a cochlear implant","authors":"G. Rachel, S. J. J. Singh, P. Vijayalakshmi","doi":"10.1109/ICRTIT.2013.6844195","DOIUrl":null,"url":null,"abstract":"A cochlear implant is a prosthetic device used to mimic the function of a cochlea in a person with profound and bilateral hearing loss caused by a damaged inner ear. The current work revolves around the design of real time channel vocoder based model of a cochlear implant in LabVIEW and the TMS320C6713 DSK. First, a uniform band 16-channel vocoder is designed for the analysis and synthesis of English vowels, where filters of 400 Hz bandwidth, with cut off frequencies up to 6200Hz are used, based on MATLAB analysis performed previously. To extend the analysis to words and sentences, short time features, namely, short time energy, short time zero crossing rate and main lobe width of the short time autocorrelation function, are extracted and Gaussian Mixture Modelling (GMM) is used to classify the speech segments as voiced or unvoiced. In an attempt to make the synthetic speech sound natural, the synthesis in the channel vocoder is done using a train of glottal pulses instead of a train of impulses. The intelligibility of the synthetic speech is measured by the Mean Opinion Score (MOS). For a channel vocoder where the synthesis section uses a train of glottal pulses, an MOS of 3.6 is obtained as against 3.5 when a train of impulses is used. The lab model of the cochlear implant, that is, the analysis section of the 16-channel vocoder is then realised in the TMS320C6713 DSK. Individual channel outputs are obtained by programming the DIP switches of the DSK and a DSK_AUDIO16_BASE, a 16-channel audio daughter card, is interfaced with the DSK to obtain the outputs from multiple channels simultaneously.","PeriodicalId":113531,"journal":{"name":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","volume":"39 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRTIT.2013.6844195","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

A cochlear implant is a prosthetic device used to mimic the function of a cochlea in a person with profound and bilateral hearing loss caused by a damaged inner ear. The current work revolves around the design of real time channel vocoder based model of a cochlear implant in LabVIEW and the TMS320C6713 DSK. First, a uniform band 16-channel vocoder is designed for the analysis and synthesis of English vowels, where filters of 400 Hz bandwidth, with cut off frequencies up to 6200Hz are used, based on MATLAB analysis performed previously. To extend the analysis to words and sentences, short time features, namely, short time energy, short time zero crossing rate and main lobe width of the short time autocorrelation function, are extracted and Gaussian Mixture Modelling (GMM) is used to classify the speech segments as voiced or unvoiced. In an attempt to make the synthetic speech sound natural, the synthesis in the channel vocoder is done using a train of glottal pulses instead of a train of impulses. The intelligibility of the synthetic speech is measured by the Mean Opinion Score (MOS). For a channel vocoder where the synthesis section uses a train of glottal pulses, an MOS of 3.6 is obtained as against 3.5 when a train of impulses is used. The lab model of the cochlear implant, that is, the analysis section of the 16-channel vocoder is then realised in the TMS320C6713 DSK. Individual channel outputs are obtained by programming the DIP switches of the DSK and a DSK_AUDIO16_BASE, a 16-channel audio daughter card, is interfaced with the DSK to obtain the outputs from multiple channels simultaneously.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
LabVIEW和数字信号处理器实现了一种基于通道声码器的人工耳蜗模型
人工耳蜗是一种假体装置,用于模仿由内耳受损引起的重度和双侧听力损失的人的耳蜗功能。目前的工作围绕着基于LabVIEW和TMS320C6713 DSK的人工耳蜗实时通道声码器模型的设计展开。首先,基于之前的MATLAB分析,设计了一个统一频带的16通道声码器,用于分析和合成英语元音,其中滤波器带宽为400 Hz,截止频率高达6200Hz。为了将分析扩展到单词和句子,提取短时间特征,即短时间能量、短时间过零率和短时间自相关函数的主瓣宽度,并使用高斯混合建模(GMM)将语音片段分类为浊音或非浊音。为了使合成语音听起来自然,通道声码器中的合成使用声门脉冲序列而不是脉冲序列来完成。合成语音的可理解性用平均意见评分(Mean Opinion Score, MOS)来衡量。对于通道声码器,其中合成部分使用声门脉冲序列,当使用脉冲序列时,获得3.6的MOS,而不是3.5。然后在TMS320C6713 DSK中实现人工耳蜗的实验室模型,即16通道声码器的分析部分。通过编程DSK的拨码开关获得单个通道输出,DSK_AUDIO16_BASE,一个16通道音频子卡,与DSK接口,同时获得多个通道的输出。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Secure AODV to combat black hole attack in MANET Position aware energy efficient multicast routing in MANET Evolutionary optimization in ANFIS for intelligent navigation system Voice based login authentication for Linux Information extraction and unfilled-form structure retrieval from filled-up forms
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1