{"title":"基于分形音频编码和误差补偿的高合成音频压缩模型","authors":"A. Ali, Loay E. George","doi":"10.33166/aetic.2022.02.001","DOIUrl":null,"url":null,"abstract":"This study presented a model for improving audio files quality using fractal coding specifically when a high compression ratio is required. The proposed high synthetic audio compression model which can be called (HSACM) is based on conventional fractal coding and lifting wavelet transform. Various lifting wavelet transform families and levels are used and their effects on the reconstructed audio files are discussed as well. Audio files from GTZAN dataset and standard measurements for data compression are used in the evaluation of the proposed model. The results reveal that using block length 50 samples which is the worst case, PSNR is increased, on average, from 34.1 to 44.8 dB and from 34.1 to 40.5 dB using lifting wavelet transform with 3 and 2 levels, respectively. Thus, the PSNR is improved by 10 and 5 dB with slightly reducing the compression ratio by 6.2 and 12.5%, respectively. Moreover, it can be noticed that adopting lifting wavelet transform with basis Haar, db1, db4, db5, cdf1.1 and cdf2.2 provide higher audio quality while db6, db8, sym7 and sym8 give the worst audio quality. Furthermore, the performance of HSACM is compared with that of existing work to highlight its performance.","PeriodicalId":36440,"journal":{"name":"Annals of Emerging Technologies in Computing","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"High Synthetic Audio Compression Model Based on Fractal Audio Coding and Error-Compensation\",\"authors\":\"A. Ali, Loay E. George\",\"doi\":\"10.33166/aetic.2022.02.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study presented a model for improving audio files quality using fractal coding specifically when a high compression ratio is required. The proposed high synthetic audio compression model which can be called (HSACM) is based on conventional fractal coding and lifting wavelet transform. Various lifting wavelet transform families and levels are used and their effects on the reconstructed audio files are discussed as well. Audio files from GTZAN dataset and standard measurements for data compression are used in the evaluation of the proposed model. The results reveal that using block length 50 samples which is the worst case, PSNR is increased, on average, from 34.1 to 44.8 dB and from 34.1 to 40.5 dB using lifting wavelet transform with 3 and 2 levels, respectively. Thus, the PSNR is improved by 10 and 5 dB with slightly reducing the compression ratio by 6.2 and 12.5%, respectively. Moreover, it can be noticed that adopting lifting wavelet transform with basis Haar, db1, db4, db5, cdf1.1 and cdf2.2 provide higher audio quality while db6, db8, sym7 and sym8 give the worst audio quality. Furthermore, the performance of HSACM is compared with that of existing work to highlight its performance.\",\"PeriodicalId\":36440,\"journal\":{\"name\":\"Annals of Emerging Technologies in Computing\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Annals of Emerging Technologies in Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.33166/aetic.2022.02.001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Emerging Technologies in Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.33166/aetic.2022.02.001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
High Synthetic Audio Compression Model Based on Fractal Audio Coding and Error-Compensation
This study presented a model for improving audio files quality using fractal coding specifically when a high compression ratio is required. The proposed high synthetic audio compression model which can be called (HSACM) is based on conventional fractal coding and lifting wavelet transform. Various lifting wavelet transform families and levels are used and their effects on the reconstructed audio files are discussed as well. Audio files from GTZAN dataset and standard measurements for data compression are used in the evaluation of the proposed model. The results reveal that using block length 50 samples which is the worst case, PSNR is increased, on average, from 34.1 to 44.8 dB and from 34.1 to 40.5 dB using lifting wavelet transform with 3 and 2 levels, respectively. Thus, the PSNR is improved by 10 and 5 dB with slightly reducing the compression ratio by 6.2 and 12.5%, respectively. Moreover, it can be noticed that adopting lifting wavelet transform with basis Haar, db1, db4, db5, cdf1.1 and cdf2.2 provide higher audio quality while db6, db8, sym7 and sym8 give the worst audio quality. Furthermore, the performance of HSACM is compared with that of existing work to highlight its performance.