{"title":"通过像素差增强技术实现遥感图像平移锐化","authors":"","doi":"10.1016/j.jag.2024.104045","DOIUrl":null,"url":null,"abstract":"<div><p>Nowadays, embedding-based pan-sharpening networks aimed at fusing panchromatic (PAN) and multispectral (MS) images are abundant, yet their results still show spectral distortion and spatial fuzziness. In this paper, we design a multi-scale fusion structure to minimize the gap between the pan-sharpened image and the reference image progressively. Specifically, we proposed a method based on the scale difference between PAN and MS images, using a convolutional neural network embedding pixel difference enhanced module (PDEM) to obtain the pan-sharpened image and minimizing the losses from each scale. The network includes three scales, each scale contains the PDEM to generate the intermediate results until to the last scale which obtains the final pan-sharpened result. The designed PDEM extracts deep features from PAN and MS images, using different kernel sizes and receptive field scales to diversify the extracted information. Three-direction pixel difference convolutions (PDCs) are utilized to maintain and enhance the edge details of spatial information. The loss function is to sum up the mean square error and mean absolute error between the pan-sharpened image and the reference image at three scales. Extensive experiments suggest the proposed method outperforms state-of-the-art methods from visual and quantitative perspectives, and confirm the effectiveness of PDEM in extracting and retaining image information and edge enhancement. The high-level vision task experiments also show our method has good practical value for further applications.</p></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":null,"pages":null},"PeriodicalIF":7.6000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1569843224003996/pdfft?md5=541c6ddba470fe05a26ccfed57a8a4c9&pid=1-s2.0-S1569843224003996-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Remote sensing image pan-sharpening via Pixel difference enhance\",\"authors\":\"\",\"doi\":\"10.1016/j.jag.2024.104045\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Nowadays, embedding-based pan-sharpening networks aimed at fusing panchromatic (PAN) and multispectral (MS) images are abundant, yet their results still show spectral distortion and spatial fuzziness. In this paper, we design a multi-scale fusion structure to minimize the gap between the pan-sharpened image and the reference image progressively. Specifically, we proposed a method based on the scale difference between PAN and MS images, using a convolutional neural network embedding pixel difference enhanced module (PDEM) to obtain the pan-sharpened image and minimizing the losses from each scale. The network includes three scales, each scale contains the PDEM to generate the intermediate results until to the last scale which obtains the final pan-sharpened result. The designed PDEM extracts deep features from PAN and MS images, using different kernel sizes and receptive field scales to diversify the extracted information. Three-direction pixel difference convolutions (PDCs) are utilized to maintain and enhance the edge details of spatial information. The loss function is to sum up the mean square error and mean absolute error between the pan-sharpened image and the reference image at three scales. Extensive experiments suggest the proposed method outperforms state-of-the-art methods from visual and quantitative perspectives, and confirm the effectiveness of PDEM in extracting and retaining image information and edge enhancement. The high-level vision task experiments also show our method has good practical value for further applications.</p></div>\",\"PeriodicalId\":73423,\"journal\":{\"name\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2024-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1569843224003996/pdfft?md5=541c6ddba470fe05a26ccfed57a8a4c9&pid=1-s2.0-S1569843224003996-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International journal of applied earth observation and geoinformation : ITC journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1569843224003996\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"REMOTE SENSING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843224003996","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0
摘要
目前,基于嵌入的全色(PAN)和多光谱(MS)图像融合的全色锐化网络非常丰富,但其结果仍然存在光谱失真和空间模糊的问题。在本文中,我们设计了一种多尺度融合结构,以逐步缩小全色锐化图像与参考图像之间的差距。具体来说,我们提出了一种基于 PAN 和 MS 图像尺度差的方法,使用嵌入像素差增强模块(PDEM)的卷积神经网络来获得平移锐化图像,并最大限度地减少每个尺度的损失。该网络包括三个尺度,每个尺度包含一个 PDEM,用于生成中间结果,直到最后一个尺度获得最终的平移锐化结果。设计的 PDEM 可从 PAN 和 MS 图像中提取深度特征,使用不同的核大小和感受野尺度来丰富提取的信息。利用三向像素差值卷积(PDC)来保持和增强空间信息的边缘细节。损失函数是平移锐化图像与参考图像在三个尺度上的均方误差和平均绝对误差之和。大量实验表明,从视觉和定量角度来看,所提出的方法优于最先进的方法,并证实了 PDEM 在提取和保留图像信息以及边缘增强方面的有效性。高级视觉任务实验也表明,我们的方法在进一步应用中具有良好的实用价值。
Remote sensing image pan-sharpening via Pixel difference enhance
Nowadays, embedding-based pan-sharpening networks aimed at fusing panchromatic (PAN) and multispectral (MS) images are abundant, yet their results still show spectral distortion and spatial fuzziness. In this paper, we design a multi-scale fusion structure to minimize the gap between the pan-sharpened image and the reference image progressively. Specifically, we proposed a method based on the scale difference between PAN and MS images, using a convolutional neural network embedding pixel difference enhanced module (PDEM) to obtain the pan-sharpened image and minimizing the losses from each scale. The network includes three scales, each scale contains the PDEM to generate the intermediate results until to the last scale which obtains the final pan-sharpened result. The designed PDEM extracts deep features from PAN and MS images, using different kernel sizes and receptive field scales to diversify the extracted information. Three-direction pixel difference convolutions (PDCs) are utilized to maintain and enhance the edge details of spatial information. The loss function is to sum up the mean square error and mean absolute error between the pan-sharpened image and the reference image at three scales. Extensive experiments suggest the proposed method outperforms state-of-the-art methods from visual and quantitative perspectives, and confirm the effectiveness of PDEM in extracting and retaining image information and edge enhancement. The high-level vision task experiments also show our method has good practical value for further applications.
期刊介绍:
The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.