Detection of AAC compression using MDCT-based features and supervised learning

IF 1.7 4区 计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Experimental & Theoretical Artificial Intelligence Pub Date : 2021-01-31 DOI:10.1080/0952813X.2021.1882003
José Juan García-Hernández, W. Gómez-Flores
{"title":"Detection of AAC compression using MDCT-based features and supervised learning","authors":"José Juan García-Hernández, W. Gómez-Flores","doi":"10.1080/0952813X.2021.1882003","DOIUrl":null,"url":null,"abstract":"ABSTRACT Audio files are frequent targets of malicious users who seek illegal profit trading with fake-quality content. For increasing the confidence in the integrity of audio files, the detection of fake-quality content is an important task. This paper proposes a method for detecting Advanced Audio Coding (AAC) compression on suspicious WAV files, in which the variance of the Modified Discrete Cosine Transform (MDCT) characterises four compression bitrates: uncompressed, 64 kbps, 128 kbps, and 256 kbps. This scheme takes advantage of the reduction of the variance of the high-frequency MDCT coefficients in compressed signals. Data obtained from MDCT coefficients generate a high-dimensional feature space. Hence, Principal Component Analysis, followed by Linear Discriminant Analysis, is used for projecting the high-dimensional data onto a lower-dimensional space. Besides, six supervised learning algorithms are compared for classifying four compression bitrates. The experiments show that using audio samples with 20 seconds and 1024 MDCT coefficients, an accuracy of 93% is reached with a Bayesian classifier. Collaterally, the detection between uncompressed and compressed signals attains an accuracy of 97% with Multinomial Logistic Regression. In conclusion, the proposed approach can detect previous AAC compression and can be potentially used when it is unfeasible to recover the suspicious signal completely.","PeriodicalId":15677,"journal":{"name":"Journal of Experimental & Theoretical Artificial Intelligence","volume":"22 1","pages":"451 - 468"},"PeriodicalIF":1.7000,"publicationDate":"2021-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental & Theoretical Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1080/0952813X.2021.1882003","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 1

Abstract

ABSTRACT Audio files are frequent targets of malicious users who seek illegal profit trading with fake-quality content. For increasing the confidence in the integrity of audio files, the detection of fake-quality content is an important task. This paper proposes a method for detecting Advanced Audio Coding (AAC) compression on suspicious WAV files, in which the variance of the Modified Discrete Cosine Transform (MDCT) characterises four compression bitrates: uncompressed, 64 kbps, 128 kbps, and 256 kbps. This scheme takes advantage of the reduction of the variance of the high-frequency MDCT coefficients in compressed signals. Data obtained from MDCT coefficients generate a high-dimensional feature space. Hence, Principal Component Analysis, followed by Linear Discriminant Analysis, is used for projecting the high-dimensional data onto a lower-dimensional space. Besides, six supervised learning algorithms are compared for classifying four compression bitrates. The experiments show that using audio samples with 20 seconds and 1024 MDCT coefficients, an accuracy of 93% is reached with a Bayesian classifier. Collaterally, the detection between uncompressed and compressed signals attains an accuracy of 97% with Multinomial Logistic Regression. In conclusion, the proposed approach can detect previous AAC compression and can be potentially used when it is unfeasible to recover the suspicious signal completely.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用基于mdct的特征和监督学习检测AAC压缩
音频文件是恶意用户利用虚假内容进行非法牟利交易的常见目标。为了提高人们对音频文件完整性的信心,对虚假内容的检测是一项重要的任务。本文提出了一种检测可疑WAV文件上的高级音频编码(AAC)压缩的方法,其中修改离散余弦变换(MDCT)的方差具有四种压缩比特率:未压缩、64 kbps、128 kbps和256 kbps。该方案利用了压缩信号中高频MDCT系数方差的减小。从MDCT系数中获得的数据生成高维特征空间。因此,主成分分析,然后是线性判别分析,用于将高维数据投影到低维空间。此外,比较了六种监督学习算法对四种压缩比特率的分类。实验表明,使用20秒的音频样本和1024个MDCT系数,贝叶斯分类器的准确率达到93%。此外,未压缩和压缩信号之间的检测与多项逻辑回归达到97%的精度。综上所述,该方法可以检测到之前的AAC压缩,并且可以在无法完全恢复可疑信号时使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.10
自引率
4.50%
发文量
89
审稿时长
>12 weeks
期刊介绍: Journal of Experimental & Theoretical Artificial Intelligence (JETAI) is a world leading journal dedicated to publishing high quality, rigorously reviewed, original papers in artificial intelligence (AI) research. The journal features work in all subfields of AI research and accepts both theoretical and applied research. Topics covered include, but are not limited to, the following: • cognitive science • games • learning • knowledge representation • memory and neural system modelling • perception • problem-solving
期刊最新文献
Occlusive target recognition method of sorting robot based on anchor-free detection network An effectual underwater image enhancement framework using adaptive trans-resunet ++ with attention mechanism An experimental study of sentiment classification using deep-based models with various word embedding techniques Sign language video to text conversion via optimised LSTM with improved motion estimation An efficient safest route prediction-based route discovery mechanism for drivers using improved golden tortoise beetle optimizer
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1