LabVIEW and digital signal processor implementation of a channel vocoder based model of a cochlear implant

2013 International Conference on Recent Trends in Information Technology (ICRTIT) Pub Date : 2013-07-25 DOI:10.1109/ICRTIT.2013.6844195

G. Rachel, S. J. J. Singh, P. Vijayalakshmi

{"title":"LabVIEW and digital signal processor implementation of a channel vocoder based model of a cochlear implant","authors":"G. Rachel, S. J. J. Singh, P. Vijayalakshmi","doi":"10.1109/ICRTIT.2013.6844195","DOIUrl":null,"url":null,"abstract":"A cochlear implant is a prosthetic device used to mimic the function of a cochlea in a person with profound and bilateral hearing loss caused by a damaged inner ear. The current work revolves around the design of real time channel vocoder based model of a cochlear implant in LabVIEW and the TMS320C6713 DSK. First, a uniform band 16-channel vocoder is designed for the analysis and synthesis of English vowels, where filters of 400 Hz bandwidth, with cut off frequencies up to 6200Hz are used, based on MATLAB analysis performed previously. To extend the analysis to words and sentences, short time features, namely, short time energy, short time zero crossing rate and main lobe width of the short time autocorrelation function, are extracted and Gaussian Mixture Modelling (GMM) is used to classify the speech segments as voiced or unvoiced. In an attempt to make the synthetic speech sound natural, the synthesis in the channel vocoder is done using a train of glottal pulses instead of a train of impulses. The intelligibility of the synthetic speech is measured by the Mean Opinion Score (MOS). For a channel vocoder where the synthesis section uses a train of glottal pulses, an MOS of 3.6 is obtained as against 3.5 when a train of impulses is used. The lab model of the cochlear implant, that is, the analysis section of the 16-channel vocoder is then realised in the TMS320C6713 DSK. Individual channel outputs are obtained by programming the DIP switches of the DSK and a DSK_AUDIO16_BASE, a 16-channel audio daughter card, is interfaced with the DSK to obtain the outputs from multiple channels simultaneously.","PeriodicalId":113531,"journal":{"name":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","volume":"39 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Conference on Recent Trends in Information Technology (ICRTIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICRTIT.2013.6844195","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

A cochlear implant is a prosthetic device used to mimic the function of a cochlea in a person with profound and bilateral hearing loss caused by a damaged inner ear. The current work revolves around the design of real time channel vocoder based model of a cochlear implant in LabVIEW and the TMS320C6713 DSK. First, a uniform band 16-channel vocoder is designed for the analysis and synthesis of English vowels, where filters of 400 Hz bandwidth, with cut off frequencies up to 6200Hz are used, based on MATLAB analysis performed previously. To extend the analysis to words and sentences, short time features, namely, short time energy, short time zero crossing rate and main lobe width of the short time autocorrelation function, are extracted and Gaussian Mixture Modelling (GMM) is used to classify the speech segments as voiced or unvoiced. In an attempt to make the synthetic speech sound natural, the synthesis in the channel vocoder is done using a train of glottal pulses instead of a train of impulses. The intelligibility of the synthetic speech is measured by the Mean Opinion Score (MOS). For a channel vocoder where the synthesis section uses a train of glottal pulses, an MOS of 3.6 is obtained as against 3.5 when a train of impulses is used. The lab model of the cochlear implant, that is, the analysis section of the 16-channel vocoder is then realised in the TMS320C6713 DSK. Individual channel outputs are obtained by programming the DIP switches of the DSK and a DSK_AUDIO16_BASE, a 16-channel audio daughter card, is interfaced with the DSK to obtain the outputs from multiple channels simultaneously.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

LabVIEW和数字信号处理器实现了一种基于通道声码器的人工耳蜗模型

人工耳蜗是一种假体装置，用于模仿由内耳受损引起的重度和双侧听力损失的人的耳蜗功能。目前的工作围绕着基于LabVIEW和TMS320C6713 DSK的人工耳蜗实时通道声码器模型的设计展开。首先，基于之前的MATLAB分析，设计了一个统一频带的16通道声码器，用于分析和合成英语元音，其中滤波器带宽为400 Hz，截止频率高达6200Hz。为了将分析扩展到单词和句子，提取短时间特征，即短时间能量、短时间过零率和短时间自相关函数的主瓣宽度，并使用高斯混合建模(GMM)将语音片段分类为浊音或非浊音。为了使合成语音听起来自然，通道声码器中的合成使用声门脉冲序列而不是脉冲序列来完成。合成语音的可理解性用平均意见评分(Mean Opinion Score, MOS)来衡量。对于通道声码器，其中合成部分使用声门脉冲序列，当使用脉冲序列时，获得3.6的MOS，而不是3.5。然后在TMS320C6713 DSK中实现人工耳蜗的实验室模型，即16通道声码器的分析部分。通过编程DSK的拨码开关获得单个通道输出，DSK_AUDIO16_BASE，一个16通道音频子卡，与DSK接口，同时获得多个通道的输出。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 International Conference on Recent Trends in Information Technology (ICRTIT)

自引率

0.00%

发文量

期刊最新文献

Secure AODV to combat black hole attack in MANET Position aware energy efficient multicast routing in MANET Evolutionary optimization in ANFIS for intelligent navigation system Voice based login authentication for Linux Information extraction and unfilled-form structure retrieval from filled-up forms