{"title":"基于深度学习的鲁棒音频水印技术对抗篡改攻击","authors":"Shuangbing Wen;Qishan Zhang;Tao Hu;Jun Li","doi":"10.1109/LSP.2024.3501285","DOIUrl":null,"url":null,"abstract":"Artificial intelligence technology has been developing rapidly, and speech synthesis models have become increasingly mature, capable of generating highly realistic synthetic audio used to disseminate misinformation, which poses a serious security risk problem. Digital watermarking technology can effectively protect digital content. Deep learning is currently achieving significant research success in digital watermarking. However, the current robustness against audio manipulation remains understudied. Based on this, we propose a robust audio watermarking method based on deep learning against manipulation attacks. Specifically, the embedding of watermarking information is performed in the encoder and the extraction of watermarking information is performed in the decoder; In addition, various audio attacks are simulated during iterative training, a sampling noise layer is used to increase robustness, and a discriminator is used to distinguish between encoded audio and original audio to improve the invisibility of the watermark. We comprehensively evaluate the performance of our model against various manipulation attacks. Experimental results demonstrate that the framework effectively embeds and extracts watermarked signals, exhibiting strong robustness.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"32 ","pages":"126-130"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Robust Audio Watermarking Against Manipulation Attacks Based on Deep Learning\",\"authors\":\"Shuangbing Wen;Qishan Zhang;Tao Hu;Jun Li\",\"doi\":\"10.1109/LSP.2024.3501285\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Artificial intelligence technology has been developing rapidly, and speech synthesis models have become increasingly mature, capable of generating highly realistic synthetic audio used to disseminate misinformation, which poses a serious security risk problem. Digital watermarking technology can effectively protect digital content. Deep learning is currently achieving significant research success in digital watermarking. However, the current robustness against audio manipulation remains understudied. Based on this, we propose a robust audio watermarking method based on deep learning against manipulation attacks. Specifically, the embedding of watermarking information is performed in the encoder and the extraction of watermarking information is performed in the decoder; In addition, various audio attacks are simulated during iterative training, a sampling noise layer is used to increase robustness, and a discriminator is used to distinguish between encoded audio and original audio to improve the invisibility of the watermark. We comprehensively evaluate the performance of our model against various manipulation attacks. Experimental results demonstrate that the framework effectively embeds and extracts watermarked signals, exhibiting strong robustness.\",\"PeriodicalId\":13154,\"journal\":{\"name\":\"IEEE Signal Processing Letters\",\"volume\":\"32 \",\"pages\":\"126-130\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Signal Processing Letters\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10756708/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Signal Processing Letters","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10756708/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Robust Audio Watermarking Against Manipulation Attacks Based on Deep Learning
Artificial intelligence technology has been developing rapidly, and speech synthesis models have become increasingly mature, capable of generating highly realistic synthetic audio used to disseminate misinformation, which poses a serious security risk problem. Digital watermarking technology can effectively protect digital content. Deep learning is currently achieving significant research success in digital watermarking. However, the current robustness against audio manipulation remains understudied. Based on this, we propose a robust audio watermarking method based on deep learning against manipulation attacks. Specifically, the embedding of watermarking information is performed in the encoder and the extraction of watermarking information is performed in the decoder; In addition, various audio attacks are simulated during iterative training, a sampling noise layer is used to increase robustness, and a discriminator is used to distinguish between encoded audio and original audio to improve the invisibility of the watermark. We comprehensively evaluate the performance of our model against various manipulation attacks. Experimental results demonstrate that the framework effectively embeds and extracts watermarked signals, exhibiting strong robustness.
期刊介绍:
The IEEE Signal Processing Letters is a monthly, archival publication designed to provide rapid dissemination of original, cutting-edge ideas and timely, significant contributions in signal, image, speech, language and audio processing. Papers published in the Letters can be presented within one year of their appearance in signal processing conferences such as ICASSP, GlobalSIP and ICIP, and also in several workshop organized by the Signal Processing Society.