Jingyu Song , Haiyong Xu , Gangyi Jiang , Mei Yu , Yeyao Chen , Ting Luo , Yang Song
{"title":"基于频域的潜在扩散模型用于水下图像增强","authors":"Jingyu Song , Haiyong Xu , Gangyi Jiang , Mei Yu , Yeyao Chen , Ting Luo , Yang Song","doi":"10.1016/j.patcog.2024.111198","DOIUrl":null,"url":null,"abstract":"<div><div>The degradation of underwater images, due to complex factors, negatively impacts the performance of underwater visual tasks. However, most underwater image enhancement methods (UIE) have been confined to the spatial domain, disregarding the frequency domain. This limitation hampers the full exploitation of the model’s learning and representational capabilities. To address this, a two-stage frequency domain-based latent diffusion model (FD-LDM) is introduced for UIE. Firstly, the model employs a lightweight parameter estimation network (L-PEN) to estimate the degradation parameters of underwater images, thereby mitigating the impact of color bias on the diffusion model. Subsequently, considering the varying degrees of recovery between high and low-frequency images, high and low-frequency priors are extracted in the second stage and integrated with the refined latent diffusion model to enhance the images further. Extensive experiments have confirmed the method’s effectiveness and robustness, particularly under color bias scenes.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"160 ","pages":"Article 111198"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Frequency domain-based latent diffusion model for underwater image enhancement\",\"authors\":\"Jingyu Song , Haiyong Xu , Gangyi Jiang , Mei Yu , Yeyao Chen , Ting Luo , Yang Song\",\"doi\":\"10.1016/j.patcog.2024.111198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The degradation of underwater images, due to complex factors, negatively impacts the performance of underwater visual tasks. However, most underwater image enhancement methods (UIE) have been confined to the spatial domain, disregarding the frequency domain. This limitation hampers the full exploitation of the model’s learning and representational capabilities. To address this, a two-stage frequency domain-based latent diffusion model (FD-LDM) is introduced for UIE. Firstly, the model employs a lightweight parameter estimation network (L-PEN) to estimate the degradation parameters of underwater images, thereby mitigating the impact of color bias on the diffusion model. Subsequently, considering the varying degrees of recovery between high and low-frequency images, high and low-frequency priors are extracted in the second stage and integrated with the refined latent diffusion model to enhance the images further. Extensive experiments have confirmed the method’s effectiveness and robustness, particularly under color bias scenes.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"160 \",\"pages\":\"Article 111198\"},\"PeriodicalIF\":7.5000,\"publicationDate\":\"2024-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S003132032400949X\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003132032400949X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Frequency domain-based latent diffusion model for underwater image enhancement
The degradation of underwater images, due to complex factors, negatively impacts the performance of underwater visual tasks. However, most underwater image enhancement methods (UIE) have been confined to the spatial domain, disregarding the frequency domain. This limitation hampers the full exploitation of the model’s learning and representational capabilities. To address this, a two-stage frequency domain-based latent diffusion model (FD-LDM) is introduced for UIE. Firstly, the model employs a lightweight parameter estimation network (L-PEN) to estimate the degradation parameters of underwater images, thereby mitigating the impact of color bias on the diffusion model. Subsequently, considering the varying degrees of recovery between high and low-frequency images, high and low-frequency priors are extracted in the second stage and integrated with the refined latent diffusion model to enhance the images further. Extensive experiments have confirmed the method’s effectiveness and robustness, particularly under color bias scenes.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.