A Pruned-CELP Speech Codec Using Denoising Autoencoder with Spectral Compensation for Quality and Intelligibility Enhancement

2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS) Pub Date : 2019-03-01 DOI:10.1109/AICAS.2019.8771507

Yu-Ting Lo, Syu-Siang Wang, Yu Tsao, Sheng-Yu Peng

引用次数: 1

Abstract

A codec based on the excited linear prediction (CELP) speech compression method adopting a denoising autoencoder with spectral compensation (DAE-SC) for quality and intelligibility enhancement is proposed in this paper. The sizes of CELP parameters in the encoder are carefully pruned to achieve a higher compression rate. To recover the speech quality and intelligibility degradation due to the pruned CELP parameters, a DAE-SC network with three hidden layers is employed in the decoder. Compared with the conventional CELP codec at a 9.6k bps transmission rate, the proposed speech codec achieves extra 21.9% bit rate reduction with comparable speech quality and intelligibility that are evaluated by four commonly used speech performance metrics.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种基于频谱补偿的去噪自编码器的剪切式celp语音编解码器，可提高语音质量和可理解性

提出了一种基于激励线性预测(CELP)语音压缩方法的编解码器，采用带谱补偿的去噪自编码器(DAE-SC)来提高语音质量和可理解性。在编码器的CELP参数的大小被仔细修剪，以实现更高的压缩率。为了恢复由于CELP参数被修剪而导致的语音质量和可理解性下降，在解码器中使用了一个三隐层的DAE-SC网络。与传输速率为9.6k bps的传统CELP编解码器相比，本文提出的语音编解码器在具有同等语音质量和可理解性的情况下实现了额外21.9%的比特率降低，并通过四个常用的语音性能指标进行了评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)

自引率

0.00%

发文量

期刊最新文献

Artificial Intelligence of Things Wearable System for Cardiac Disease Detection Fast event-driven incremental learning of hand symbols Accelerating CNN-RNN Based Machine Health Monitoring on FPGA Neuromorphic networks on the SpiNNaker platform Complexity Reduction on HEVC Intra Mode Decision with modified LeNet-5