{"title":"利用 CNN 和主成分分析实现音频压缩中的动态差异控制","authors":"Asish Debnath, Uttam Kr. Mondal","doi":"10.1007/s41870-024-02155-8","DOIUrl":null,"url":null,"abstract":"<p>This study addresses challenges arising from large audio file storage needs and rising network bandwidth demands. In this paper, a novel audio codec design is proposed, integrating audio sample segregation, user input variance controlled principal component analysis (PCA), and Convolutional Neural Network (CNN). PCA computes sample variance feature vectors, extracts principal components, and determines compression rates. This method leverages PCA and CNN to compress audio efficiently, yielding high-quality reconstructed audio. Experimental results show that increasing PCA components generally improves PSNR values, while decreasing components may reduce CR, MSE, and other error metrics. The simulation results are analyzed and compared to other existing lossless audio encoding schemes with various statistical and robustness features.</p>","PeriodicalId":14138,"journal":{"name":"International Journal of Information Technology","volume":"3 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging CNN and principal component analysis for dynamic variance control in audio compression\",\"authors\":\"Asish Debnath, Uttam Kr. Mondal\",\"doi\":\"10.1007/s41870-024-02155-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This study addresses challenges arising from large audio file storage needs and rising network bandwidth demands. In this paper, a novel audio codec design is proposed, integrating audio sample segregation, user input variance controlled principal component analysis (PCA), and Convolutional Neural Network (CNN). PCA computes sample variance feature vectors, extracts principal components, and determines compression rates. This method leverages PCA and CNN to compress audio efficiently, yielding high-quality reconstructed audio. Experimental results show that increasing PCA components generally improves PSNR values, while decreasing components may reduce CR, MSE, and other error metrics. The simulation results are analyzed and compared to other existing lossless audio encoding schemes with various statistical and robustness features.</p>\",\"PeriodicalId\":14138,\"journal\":{\"name\":\"International Journal of Information Technology\",\"volume\":\"3 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Information Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s41870-024-02155-8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41870-024-02155-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Leveraging CNN and principal component analysis for dynamic variance control in audio compression
This study addresses challenges arising from large audio file storage needs and rising network bandwidth demands. In this paper, a novel audio codec design is proposed, integrating audio sample segregation, user input variance controlled principal component analysis (PCA), and Convolutional Neural Network (CNN). PCA computes sample variance feature vectors, extracts principal components, and determines compression rates. This method leverages PCA and CNN to compress audio efficiently, yielding high-quality reconstructed audio. Experimental results show that increasing PCA components generally improves PSNR values, while decreasing components may reduce CR, MSE, and other error metrics. The simulation results are analyzed and compared to other existing lossless audio encoding schemes with various statistical and robustness features.