{"title":"基于改进U-Net的复杂成像环境下光学和红外图像融合的深度学习网络。","authors":"Bing-Quan Xiang, Chao Pan, Jin Liu","doi":"10.1364/JOSAA.492002","DOIUrl":null,"url":null,"abstract":"<p><p>The fusion of optical and infrared images is a critical task in the field of image processing. However, it is challenging to achieve optimal results when fusing images from complex environments. In this paper, we propose a deep learning network model comprising an encoding network and a decoding network based on the modified U-Net network to fuse low-quality images from complex imaging environments. As both encoding and decoding networks use similar convolutional modules, they can share similar layer structures to improve the overall fusion performance. Furthermore, an attention mechanism module is integrated into the decoding network to identify and capture the crucial features of the fused images. It can assist the deep learning network to extract more relevant image features and thus get more accurate fusion. The proposed model has been compared with some existing methods to prove its performance in view of subjective and objective evaluations.</p>","PeriodicalId":17382,"journal":{"name":"Journal of The Optical Society of America A-optics Image Science and Vision","volume":"40 9","pages":"1644-1653"},"PeriodicalIF":1.4000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Deep learning network for fusing optical and infrared images in a complex imaging environment by using the modified U-Net.\",\"authors\":\"Bing-Quan Xiang, Chao Pan, Jin Liu\",\"doi\":\"10.1364/JOSAA.492002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The fusion of optical and infrared images is a critical task in the field of image processing. However, it is challenging to achieve optimal results when fusing images from complex environments. In this paper, we propose a deep learning network model comprising an encoding network and a decoding network based on the modified U-Net network to fuse low-quality images from complex imaging environments. As both encoding and decoding networks use similar convolutional modules, they can share similar layer structures to improve the overall fusion performance. Furthermore, an attention mechanism module is integrated into the decoding network to identify and capture the crucial features of the fused images. It can assist the deep learning network to extract more relevant image features and thus get more accurate fusion. The proposed model has been compared with some existing methods to prove its performance in view of subjective and objective evaluations.</p>\",\"PeriodicalId\":17382,\"journal\":{\"name\":\"Journal of The Optical Society of America A-optics Image Science and Vision\",\"volume\":\"40 9\",\"pages\":\"1644-1653\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2023-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of The Optical Society of America A-optics Image Science and Vision\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1364/JOSAA.492002\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"OPTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Optical Society of America A-optics Image Science and Vision","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1364/JOSAA.492002","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"OPTICS","Score":null,"Total":0}
Deep learning network for fusing optical and infrared images in a complex imaging environment by using the modified U-Net.
The fusion of optical and infrared images is a critical task in the field of image processing. However, it is challenging to achieve optimal results when fusing images from complex environments. In this paper, we propose a deep learning network model comprising an encoding network and a decoding network based on the modified U-Net network to fuse low-quality images from complex imaging environments. As both encoding and decoding networks use similar convolutional modules, they can share similar layer structures to improve the overall fusion performance. Furthermore, an attention mechanism module is integrated into the decoding network to identify and capture the crucial features of the fused images. It can assist the deep learning network to extract more relevant image features and thus get more accurate fusion. The proposed model has been compared with some existing methods to prove its performance in view of subjective and objective evaluations.
期刊介绍:
The Journal of the Optical Society of America A (JOSA A) is devoted to developments in any field of classical optics, image science, and vision. JOSA A includes original peer-reviewed papers on such topics as:
* Atmospheric optics
* Clinical vision
* Coherence and Statistical Optics
* Color
* Diffraction and gratings
* Image processing
* Machine vision
* Physiological optics
* Polarization
* Scattering
* Signal processing
* Thin films
* Visual optics
Also: j opt soc am a.