{"title":"基于改进LSTM的图像压缩算法研究","authors":"Shasha Li, Mengjun Li, Pengfei Li, Yongjun Li","doi":"10.1109/ISCEIC51027.2020.00031","DOIUrl":null,"url":null,"abstract":"With the advent of the era of big data, storing and transferring data is facing tremendous pressure. How to use deep learning to obtain higher compression ratio and higher quality images has become an urgent problem. Recurrent neural network (RNN) can control the bit rate of images with iterative manner to improve compression performance. However, RNN needs to introduce long short term memory (LSTM) to solve the problem of long-term dependence, which leads to the model more complex. In order to speed up training process and reconstruct higher-quality images, firstly, this paper improves the activation function in LSTM to better determine the information to be stored or forgotten, so that the amount of parameters is reduced and the training process is faster. Then, the image recovery block is introduced in the decoder to reconstruct high-resolution images. Finally, instead of L1 loss, we use SmoothL1 loss to accelerate the convergence of loss. Experimental results show that our model can achieve a higher compression ratio, and evaluated by SSIM the value is more nearly to 1.","PeriodicalId":249521,"journal":{"name":"2020 International Symposium on Computer Engineering and Intelligent Communications (ISCEIC)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Image compression algorithm research based on improved LSTM\",\"authors\":\"Shasha Li, Mengjun Li, Pengfei Li, Yongjun Li\",\"doi\":\"10.1109/ISCEIC51027.2020.00031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advent of the era of big data, storing and transferring data is facing tremendous pressure. How to use deep learning to obtain higher compression ratio and higher quality images has become an urgent problem. Recurrent neural network (RNN) can control the bit rate of images with iterative manner to improve compression performance. However, RNN needs to introduce long short term memory (LSTM) to solve the problem of long-term dependence, which leads to the model more complex. In order to speed up training process and reconstruct higher-quality images, firstly, this paper improves the activation function in LSTM to better determine the information to be stored or forgotten, so that the amount of parameters is reduced and the training process is faster. Then, the image recovery block is introduced in the decoder to reconstruct high-resolution images. Finally, instead of L1 loss, we use SmoothL1 loss to accelerate the convergence of loss. Experimental results show that our model can achieve a higher compression ratio, and evaluated by SSIM the value is more nearly to 1.\",\"PeriodicalId\":249521,\"journal\":{\"name\":\"2020 International Symposium on Computer Engineering and Intelligent Communications (ISCEIC)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Symposium on Computer Engineering and Intelligent Communications (ISCEIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCEIC51027.2020.00031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Symposium on Computer Engineering and Intelligent Communications (ISCEIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCEIC51027.2020.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Image compression algorithm research based on improved LSTM
With the advent of the era of big data, storing and transferring data is facing tremendous pressure. How to use deep learning to obtain higher compression ratio and higher quality images has become an urgent problem. Recurrent neural network (RNN) can control the bit rate of images with iterative manner to improve compression performance. However, RNN needs to introduce long short term memory (LSTM) to solve the problem of long-term dependence, which leads to the model more complex. In order to speed up training process and reconstruct higher-quality images, firstly, this paper improves the activation function in LSTM to better determine the information to be stored or forgotten, so that the amount of parameters is reduced and the training process is faster. Then, the image recovery block is introduced in the decoder to reconstruct high-resolution images. Finally, instead of L1 loss, we use SmoothL1 loss to accelerate the convergence of loss. Experimental results show that our model can achieve a higher compression ratio, and evaluated by SSIM the value is more nearly to 1.