Quality Estimation of Noisy Speech Using Spectral Entropy Distance

Gabriel Mittag, S. Möller
{"title":"Quality Estimation of Noisy Speech Using Spectral Entropy Distance","authors":"Gabriel Mittag, S. Möller","doi":"10.1109/ICT.2019.8798783","DOIUrl":null,"url":null,"abstract":"In this paper, we propose to use spectral entropy distance as a new measure for objective quality estimations of noisy speech. While the perceived quality estimation of a transmitted speech signal under background noise is fairly straight forward, the estimation of noise on active speech is more complex. For example, an increase in loudness can be confused as noise by common quality measures. Also, other distortions, such as interruptions due to packet loss, can decrease the energy in the degraded signal and thus lead to an underestimation of the noisiness. This is especially critical when the noise is only present during active speech segments, as it is the case for quantization noise caused by low bitrate codecs or voice activity detections at the receiver side. The spectral entropy, however, only considers the frequency composition of a signal and does not depend on the signal energy. Therefore, it gives a robust measure of how noisy a signal is in the presence of active speech. In our experiments, we trained a prediction model based on the spectral entropy and obtained excellent prediction results that show that the spectral entropy distance is indeed a useful tool for the quality estimation of noisy speech.","PeriodicalId":127412,"journal":{"name":"2019 26th International Conference on Telecommunications (ICT)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 26th International Conference on Telecommunications (ICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICT.2019.8798783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this paper, we propose to use spectral entropy distance as a new measure for objective quality estimations of noisy speech. While the perceived quality estimation of a transmitted speech signal under background noise is fairly straight forward, the estimation of noise on active speech is more complex. For example, an increase in loudness can be confused as noise by common quality measures. Also, other distortions, such as interruptions due to packet loss, can decrease the energy in the degraded signal and thus lead to an underestimation of the noisiness. This is especially critical when the noise is only present during active speech segments, as it is the case for quantization noise caused by low bitrate codecs or voice activity detections at the receiver side. The spectral entropy, however, only considers the frequency composition of a signal and does not depend on the signal energy. Therefore, it gives a robust measure of how noisy a signal is in the presence of active speech. In our experiments, we trained a prediction model based on the spectral entropy and obtained excellent prediction results that show that the spectral entropy distance is indeed a useful tool for the quality estimation of noisy speech.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于谱熵距离的含噪语音质量估计
本文提出将谱熵距离作为噪声语音客观质量估计的一种新测度。背景噪声下传输语音信号的感知质量估计比较简单,而主动语音信号的感知质量估计则比较复杂。例如,通过普通的质量测量,声音的增加可能会被混淆为噪音。此外,其他失真,如由于丢包而导致的中断,可以减少降级信号中的能量,从而导致对噪声的低估。当噪声仅在活动语音段中存在时,这一点尤其重要,因为这是由低比特率编解码器或接收端语音活动检测引起的量化噪声的情况。而谱熵只考虑信号的频率组成,不依赖于信号的能量。因此,它提供了一种鲁棒的方法来衡量在主动语音存在的情况下信号的噪声。在我们的实验中,我们训练了一个基于谱熵的预测模型,并获得了良好的预测结果,表明谱熵距离确实是一个有用的工具,用于噪声语音的质量估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Dual-polarized Antennas Based Directional Modulation Scheme Sliding-Window Processing of Turbo Equalization for Partial Response Channels Feature fusion by using LBP, HOG, GIST descriptors and Canonical Correlation Analysis for face recognition Periodic Time Series Data Classification By Deep Neural Network SFDS: A Self-Feedback Detection System for DNS Hijacking Based on Multi-Protocol Cross Validation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1