A real-time tone enhancement method for continuous Mandarin speeches

Ye Tian, Jia Jia, Yongxin Wang, Lianhong Cai
{"title":"A real-time tone enhancement method for continuous Mandarin speeches","authors":"Ye Tian, Jia Jia, Yongxin Wang, Lianhong Cai","doi":"10.1109/ISCSLP.2012.6423534","DOIUrl":null,"url":null,"abstract":"Chinese Mandarin is a tonal language. Tone perception ability of people with sensorineural hearing loss (SNHL) is often weaker than normal people. To help the SNHL people better perceive and distinguish tone information in Chinese speech, we focus on real-time tone enhancement method for mandarin continuous speeches. In this paper, based on the experimental investigation on the acoustic features most related to tone perception, we propose a practical tone enhancing model which employs the unified features independent of Chinese tonal patterns. Using this model, we further implement a real-time tone enhancement method which can avoid syllable segmentation and tonal pattern recognition. By the tone identification test for the normal and SNHL people under both quiet and noisy backgrounds, it is found that the enhanced speeches with the proposed method gains an average 5% higher correct rate compared to original speeches. And the time delay of the enhancement method can be controlled within 800ms, which can be further used in hearing aids to benefit the SNHL people in their daily life.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Chinese Mandarin is a tonal language. Tone perception ability of people with sensorineural hearing loss (SNHL) is often weaker than normal people. To help the SNHL people better perceive and distinguish tone information in Chinese speech, we focus on real-time tone enhancement method for mandarin continuous speeches. In this paper, based on the experimental investigation on the acoustic features most related to tone perception, we propose a practical tone enhancing model which employs the unified features independent of Chinese tonal patterns. Using this model, we further implement a real-time tone enhancement method which can avoid syllable segmentation and tonal pattern recognition. By the tone identification test for the normal and SNHL people under both quiet and noisy backgrounds, it is found that the enhanced speeches with the proposed method gains an average 5% higher correct rate compared to original speeches. And the time delay of the enhancement method can be controlled within 800ms, which can be further used in hearing aids to benefit the SNHL people in their daily life.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
普通话连续演讲的实时语气增强方法
汉语普通话是一种声调语言。感音神经性听力损失(SNHL)患者的音调感知能力往往弱于正常人。为了帮助SNHL人群更好地感知和区分汉语语音中的语调信息,我们重点研究了汉语连续语音的实时语调增强方法。本文在对与声调感知最相关的声学特征进行实验研究的基础上,提出了一种独立于汉语声调模式的统一特征的实用的声调增强模型。利用该模型,我们进一步实现了一种实时的音调增强方法,该方法可以避免音节分割和音调模式识别。通过对安静和嘈杂背景下正常人和SNHL人群的语音识别测试,发现采用该方法增强的语音比原始语音的正确率平均提高了5%。增强方法的延时可控制在800ms以内,可进一步应用于助听器,使SNHL患者在日常生活中受益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Noise-robust whispered speech recognition using a non-audible-murmur microphone with VTS compensation Effects of excitation spread on the intelligibility of Mandarin speech in cochlear implant simulations A comparative study of fMPE and RDLT approaches to LVCSR Keyword-specific normalization based keyword spotting for spontaneous speech A unified trajectory tiling approach to high quality TTS and cross-lingual voice transformation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1