Advances in speech and audio processing and coding

2015 6th International Conference on Information, Intelligence, Systems and Applications (IISA) Pub Date : 2015-07-06 DOI:10.1109/IISA.2015.7388064

A. Spanias

{"title":"Advances in speech and audio processing and coding","authors":"A. Spanias","doi":"10.1109/IISA.2015.7388064","DOIUrl":null,"url":null,"abstract":"This plenary session will cover speech processing research advances with the emphasis on speech and audio coding methods. In the session, we will discuss the fundamental principles, techniques, and algorithms used in current coding applications including a summary of codecs for telecommunication standards. The session will start with a discussion on: the basic speech representation methods, the performance measures used to evaluate coded speech, and the role of the standards. Brief algorithm descriptions include: ADPCM, sub-band coding, adaptive transform coding, sinusoidal transform coding (STC), linear predictive coding (LPC), and analysis-by-synthesis LPC (sparse excitation, code excited LPC, and ACELP). The presentation will feature audio, and computer demonstrations of recent speech coding standards including voice-over IP algorithms. The plenary session will also cover wideband audio standards such as MPEG audio and other layers (e.g., MP3, AAC). Recent algorithms will also be described including the following: Variable-Rate Multimode Wideband (VMR-WB), Speex, G722.1, OGG Vorbis 2012, iLBC, SELT, SILK, Opus 2013, Qualcomm wideband 5G codecs. At the end of the session, we will cover briefly recent applications that use voice features for detecting speech pathologies, and also discuss how long-term speech parameters can be used as predictors of other diseases such as tremors, Alzheimer's etc.","PeriodicalId":433872,"journal":{"name":"2015 6th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 6th International Conference on Information, Intelligence, Systems and Applications (IISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2015.7388064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

This plenary session will cover speech processing research advances with the emphasis on speech and audio coding methods. In the session, we will discuss the fundamental principles, techniques, and algorithms used in current coding applications including a summary of codecs for telecommunication standards. The session will start with a discussion on: the basic speech representation methods, the performance measures used to evaluate coded speech, and the role of the standards. Brief algorithm descriptions include: ADPCM, sub-band coding, adaptive transform coding, sinusoidal transform coding (STC), linear predictive coding (LPC), and analysis-by-synthesis LPC (sparse excitation, code excited LPC, and ACELP). The presentation will feature audio, and computer demonstrations of recent speech coding standards including voice-over IP algorithms. The plenary session will also cover wideband audio standards such as MPEG audio and other layers (e.g., MP3, AAC). Recent algorithms will also be described including the following: Variable-Rate Multimode Wideband (VMR-WB), Speex, G722.1, OGG Vorbis 2012, iLBC, SELT, SILK, Opus 2013, Qualcomm wideband 5G codecs. At the end of the session, we will cover briefly recent applications that use voice features for detecting speech pathologies, and also discuss how long-term speech parameters can be used as predictors of other diseases such as tremors, Alzheimer's etc.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

语音和音频处理和编码的进展

本次全体会议将介绍语音处理的研究进展，重点是语音和音频编码方法。在会议中，我们将讨论当前编码应用中使用的基本原理，技术和算法，包括电信标准的编解码器摘要。会议将首先讨论:基本的语音表示方法，用于评估编码语音的性能指标，以及标准的作用。简要的算法描述包括:ADPCM、子带编码、自适应变换编码、正弦变换编码(STC)、线性预测编码(LPC)和合成分析LPC(稀疏激励、码激励LPC和ACELP)。该演讲将以音频和计算机演示最新的语音编码标准，包括语音IP算法。全体会议还将讨论宽带音频标准，如MPEG音频和其他层(如MP3、AAC)。最新的算法还包括:可变速率多模宽带(VMR-WB)、Speex、G722.1、OGG Vorbis 2012、iLBC、SELT、SILK、Opus 2013、高通宽带5G编解码器。在会议结束时，我们将简要介绍使用语音特征检测语言病理的最新应用，并讨论如何将长期语音参数用作其他疾病(如震颤，阿尔茨海默氏症等)的预测因子。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 6th International Conference on Information, Intelligence, Systems and Applications (IISA)

自引率

0.00%

发文量