{"title":"矢量量化水下图像增强与变压器","authors":"Xueyan Ding;Yixin Sui;Jianxin Zhang","doi":"10.1109/JOE.2024.3458348","DOIUrl":null,"url":null,"abstract":"Due to the complexity of underwater imaging environments, underwater images often suffer from blurriness, low contrast and color distortion, presenting a great challenge for underwater tasks. In this article, we propose a vector quantized underwater image enhancement network, which takes full advantage of generative adversarial networks and transformers through quantization. The proposed method consists of two parts: a vector quantized generative network and an axial flow-guided latent transformer. The vector quantized generative network first learns discrete content representations of underwater images through a vector quantized codebook. To facilitate deep feature extraction, an enhanced residual attention module that exploits the strengths of residual connection and channel-wise attention is introduced. After representing the content representation using codebook-indices, we use the axial flow-guided latent transformer to learn the content distribution in an autoregressive manner. The collaboration of generative adversarial networks and transformers assists in capturing both local and global dependencies in underwater images. Experimental results on publicly available data sets comprehensively validate the remarkable performance of the proposed method in underwater image enhancement tasks.","PeriodicalId":13191,"journal":{"name":"IEEE Journal of Oceanic Engineering","volume":"50 1","pages":"136-149"},"PeriodicalIF":3.8000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Vector Quantized Underwater Image Enhancement With Transformers\",\"authors\":\"Xueyan Ding;Yixin Sui;Jianxin Zhang\",\"doi\":\"10.1109/JOE.2024.3458348\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Due to the complexity of underwater imaging environments, underwater images often suffer from blurriness, low contrast and color distortion, presenting a great challenge for underwater tasks. In this article, we propose a vector quantized underwater image enhancement network, which takes full advantage of generative adversarial networks and transformers through quantization. The proposed method consists of two parts: a vector quantized generative network and an axial flow-guided latent transformer. The vector quantized generative network first learns discrete content representations of underwater images through a vector quantized codebook. To facilitate deep feature extraction, an enhanced residual attention module that exploits the strengths of residual connection and channel-wise attention is introduced. After representing the content representation using codebook-indices, we use the axial flow-guided latent transformer to learn the content distribution in an autoregressive manner. The collaboration of generative adversarial networks and transformers assists in capturing both local and global dependencies in underwater images. Experimental results on publicly available data sets comprehensively validate the remarkable performance of the proposed method in underwater image enhancement tasks.\",\"PeriodicalId\":13191,\"journal\":{\"name\":\"IEEE Journal of Oceanic Engineering\",\"volume\":\"50 1\",\"pages\":\"136-149\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2024-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Oceanic Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10747805/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Oceanic Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10747805/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
Vector Quantized Underwater Image Enhancement With Transformers
Due to the complexity of underwater imaging environments, underwater images often suffer from blurriness, low contrast and color distortion, presenting a great challenge for underwater tasks. In this article, we propose a vector quantized underwater image enhancement network, which takes full advantage of generative adversarial networks and transformers through quantization. The proposed method consists of two parts: a vector quantized generative network and an axial flow-guided latent transformer. The vector quantized generative network first learns discrete content representations of underwater images through a vector quantized codebook. To facilitate deep feature extraction, an enhanced residual attention module that exploits the strengths of residual connection and channel-wise attention is introduced. After representing the content representation using codebook-indices, we use the axial flow-guided latent transformer to learn the content distribution in an autoregressive manner. The collaboration of generative adversarial networks and transformers assists in capturing both local and global dependencies in underwater images. Experimental results on publicly available data sets comprehensively validate the remarkable performance of the proposed method in underwater image enhancement tasks.
期刊介绍:
The IEEE Journal of Oceanic Engineering (ISSN 0364-9059) is the online-only quarterly publication of the IEEE Oceanic Engineering Society (IEEE OES). The scope of the Journal is the field of interest of the IEEE OES, which encompasses all aspects of science, engineering, and technology that address research, development, and operations pertaining to all bodies of water. This includes the creation of new capabilities and technologies from concept design through prototypes, testing, and operational systems to sense, explore, understand, develop, use, and responsibly manage natural resources.