基于人听觉系统的高容量对数音频水印

2012 IEEE International Symposium on Multimedia Pub Date : 2012-12-10 DOI:10.1109/ISM.2012.13

Mehdi Fallahpour, D. Megías

{"title":"基于人听觉系统的高容量对数音频水印","authors":"Mehdi Fallahpour, D. Megías","doi":"10.1109/ISM.2012.13","DOIUrl":null,"url":null,"abstract":"This paper proposes a high capacity audio watermarking algorithm in the logarithm domain based on the absolute threshold of hearing (ATH) of the human auditory system (HAS) which makes this scheme a novel technique. The key idea is to divide the selected frequency band into short frames and quantize the samples based on the HAS. Apart from remarkable capacity, transparency and robustness, this scheme provides three parameters (frequency band, scale factor, and frame size) which facilitate the regulation of the watermarking properties. The experimental results show that the method has a high capacity (800 to 7000 bits per second), without significant perceptual distortion (ODG is greater than - 1) and provides robustness against common audio signal processing such as added noise, filtering and MPEG compression (MP3).","PeriodicalId":282528,"journal":{"name":"2012 IEEE International Symposium on Multimedia","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"High Capacity Logarithmic Audio Watermarking Based on the Human Auditory System\",\"authors\":\"Mehdi Fallahpour, D. Megías\",\"doi\":\"10.1109/ISM.2012.13\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a high capacity audio watermarking algorithm in the logarithm domain based on the absolute threshold of hearing (ATH) of the human auditory system (HAS) which makes this scheme a novel technique. The key idea is to divide the selected frequency band into short frames and quantize the samples based on the HAS. Apart from remarkable capacity, transparency and robustness, this scheme provides three parameters (frequency band, scale factor, and frame size) which facilitate the regulation of the watermarking properties. The experimental results show that the method has a high capacity (800 to 7000 bits per second), without significant perceptual distortion (ODG is greater than - 1) and provides robustness against common audio signal processing such as added noise, filtering and MPEG compression (MP3).\",\"PeriodicalId\":282528,\"journal\":{\"name\":\"2012 IEEE International Symposium on Multimedia\",\"volume\":\"114 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Symposium on Multimedia\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISM.2012.13\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Symposium on Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISM.2012.13","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

本文提出了一种基于听觉绝对阈值的对数域高容量音频水印算法，使该算法成为一种新技术。其关键思想是将所选频带划分为短帧，并基于HAS对采样进行量化。该方案除了具有显著的容量、透明度和鲁棒性外，还提供了三个参数(频带、比例因子和帧大小)，便于对水印特性进行调节。实验结果表明，该方法具有高容量(800 ~ 7000比特/秒)，没有明显的感知失真(ODG大于- 1)，并且对常见的音频信号处理(如添加噪声、滤波和MPEG压缩(MP3))具有鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

High Capacity Logarithmic Audio Watermarking Based on the Human Auditory System

This paper proposes a high capacity audio watermarking algorithm in the logarithm domain based on the absolute threshold of hearing (ATH) of the human auditory system (HAS) which makes this scheme a novel technique. The key idea is to divide the selected frequency band into short frames and quantize the samples based on the HAS. Apart from remarkable capacity, transparency and robustness, this scheme provides three parameters (frequency band, scale factor, and frame size) which facilitate the regulation of the watermarking properties. The experimental results show that the method has a high capacity (800 to 7000 bits per second), without significant perceptual distortion (ODG is greater than - 1) and provides robustness against common audio signal processing such as added noise, filtering and MPEG compression (MP3).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 IEEE International Symposium on Multimedia

自引率

0.00%

发文量