A real-time tone enhancement method for continuous Mandarin speeches

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI:10.1109/ISCSLP.2012.6423534

Ye Tian, Jia Jia, Yongxin Wang, Lianhong Cai

{"title":"A real-time tone enhancement method for continuous Mandarin speeches","authors":"Ye Tian, Jia Jia, Yongxin Wang, Lianhong Cai","doi":"10.1109/ISCSLP.2012.6423534","DOIUrl":null,"url":null,"abstract":"Chinese Mandarin is a tonal language. Tone perception ability of people with sensorineural hearing loss (SNHL) is often weaker than normal people. To help the SNHL people better perceive and distinguish tone information in Chinese speech, we focus on real-time tone enhancement method for mandarin continuous speeches. In this paper, based on the experimental investigation on the acoustic features most related to tone perception, we propose a practical tone enhancing model which employs the unified features independent of Chinese tonal patterns. Using this model, we further implement a real-time tone enhancement method which can avoid syllable segmentation and tonal pattern recognition. By the tone identification test for the normal and SNHL people under both quiet and noisy backgrounds, it is found that the enhanced speeches with the proposed method gains an average 5% higher correct rate compared to original speeches. And the time delay of the enhancement method can be controlled within 800ms, which can be further used in hearing aids to benefit the SNHL people in their daily life.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Chinese Mandarin is a tonal language. Tone perception ability of people with sensorineural hearing loss (SNHL) is often weaker than normal people. To help the SNHL people better perceive and distinguish tone information in Chinese speech, we focus on real-time tone enhancement method for mandarin continuous speeches. In this paper, based on the experimental investigation on the acoustic features most related to tone perception, we propose a practical tone enhancing model which employs the unified features independent of Chinese tonal patterns. Using this model, we further implement a real-time tone enhancement method which can avoid syllable segmentation and tonal pattern recognition. By the tone identification test for the normal and SNHL people under both quiet and noisy backgrounds, it is found that the enhanced speeches with the proposed method gains an average 5% higher correct rate compared to original speeches. And the time delay of the enhancement method can be controlled within 800ms, which can be further used in hearing aids to benefit the SNHL people in their daily life.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

普通话连续演讲的实时语气增强方法

汉语普通话是一种声调语言。感音神经性听力损失(SNHL)患者的音调感知能力往往弱于正常人。为了帮助SNHL人群更好地感知和区分汉语语音中的语调信息，我们重点研究了汉语连续语音的实时语调增强方法。本文在对与声调感知最相关的声学特征进行实验研究的基础上，提出了一种独立于汉语声调模式的统一特征的实用的声调增强模型。利用该模型，我们进一步实现了一种实时的音调增强方法，该方法可以避免音节分割和音调模式识别。通过对安静和嘈杂背景下正常人和SNHL人群的语音识别测试，发现采用该方法增强的语音比原始语音的正确率平均提高了5%。增强方法的延时可控制在800ms以内，可进一步应用于助听器，使SNHL患者在日常生活中受益。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2012 8th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量