RAM: Interpreting real-world image super-resolution in the industry environment

IF 3.3 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pattern Recognition Letters Pub Date : 2025-04-03 DOI:10.1016/j.patrec.2025.03.034
Ze-Yu Mi, Yu-Bin Yang
{"title":"RAM: Interpreting real-world image super-resolution in the industry environment","authors":"Ze-Yu Mi,&nbsp;Yu-Bin Yang","doi":"10.1016/j.patrec.2025.03.034","DOIUrl":null,"url":null,"abstract":"<div><div>Industrial image super-resolution (SR) plays a crucial role in various industrial applications by generating high-resolution images that enhance image quality, clarity, and texture. The interpretability of industrial SR models is becoming increasingly important, enabling designers and quality inspectors to perform detailed image analysis and make more informed decisions. However, existing interpretability methods struggle to adapt to the complex degradation and diverse image patterns in industrial SR, making it challenging to provide reliable and accurate interpretations. To address this challenge, we propose a novel approach, Real Attribution Maps (RAM), designed for precise interpretation of industrial SR. RAM introduces two key components: the multi-path downsampling (MPD) function and the multi-progressive degradation (MPG) function. The MPD generates multiple attribution paths by applying a range of downsampling strategies, while the MPG incorporates random degradation kernels to better simulate real-world conditions, ensuring more accurate feature attribution. The final attribution map is derived by averaging the results from all paths. Extensive experiments conducted on industrial datasets, including IndSR, Wafer Maps, and Pelvis, validate the effectiveness of RAM. Our results show substantial improvements in several interpretation evaluation metrics and enhanced visual explanations that eliminate irrelevant interference. This work provides a powerful and versatile tool for explaining industrial SR models, offering significant advances in the interpretability of complex industrial images.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"192 ","pages":"Pages 86-92"},"PeriodicalIF":3.3000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525001266","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Industrial image super-resolution (SR) plays a crucial role in various industrial applications by generating high-resolution images that enhance image quality, clarity, and texture. The interpretability of industrial SR models is becoming increasingly important, enabling designers and quality inspectors to perform detailed image analysis and make more informed decisions. However, existing interpretability methods struggle to adapt to the complex degradation and diverse image patterns in industrial SR, making it challenging to provide reliable and accurate interpretations. To address this challenge, we propose a novel approach, Real Attribution Maps (RAM), designed for precise interpretation of industrial SR. RAM introduces two key components: the multi-path downsampling (MPD) function and the multi-progressive degradation (MPG) function. The MPD generates multiple attribution paths by applying a range of downsampling strategies, while the MPG incorporates random degradation kernels to better simulate real-world conditions, ensuring more accurate feature attribution. The final attribution map is derived by averaging the results from all paths. Extensive experiments conducted on industrial datasets, including IndSR, Wafer Maps, and Pelvis, validate the effectiveness of RAM. Our results show substantial improvements in several interpretation evaluation metrics and enhanced visual explanations that eliminate irrelevant interference. This work provides a powerful and versatile tool for explaining industrial SR models, offering significant advances in the interpretability of complex industrial images.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
RAM:在工业环境中解读真实世界的图像超分辨率
工业图像超分辨率(SR)通过生成高分辨率图像来增强图像质量、清晰度和纹理,在各种工业应用中起着至关重要的作用。工业SR模型的可解释性变得越来越重要,使设计师和质量检查员能够执行详细的图像分析并做出更明智的决策。然而,现有的可解释性方法难以适应工业SR中复杂的退化和多样化的图像模式,因此难以提供可靠和准确的解释。为了应对这一挑战,我们提出了一种新颖的方法,真实属性图(RAM),旨在精确解释工业sr。RAM引入了两个关键组件:多路径下采样(MPD)功能和多渐进退化(MPG)功能。MPD通过应用一系列降采样策略生成多个属性路径,而MPG采用随机退化核来更好地模拟现实条件,确保更准确的特征属性。最终的归因图是通过对所有路径的结果进行平均而得到的。在工业数据集(包括IndSR、Wafer Maps和盆骨)上进行的大量实验验证了RAM的有效性。我们的研究结果显示,在几个口译评估指标和增强的视觉解释方面有了实质性的改进,消除了不相关的干扰。这项工作为解释工业SR模型提供了一个强大而通用的工具,在复杂工业图像的可解释性方面取得了重大进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Pattern Recognition Letters
Pattern Recognition Letters 工程技术-计算机:人工智能
CiteScore
12.40
自引率
5.90%
发文量
287
审稿时长
9.1 months
期刊介绍: Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition. Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.
期刊最新文献
Editorial Board Generalization performance distributions along learning curves Hierarchical memory-enhanced networks for student knowledge tracing Frequency-selective countnet: Enhancing text-guided object counting with frequency features PE-ViT: Parameter-efficient vision transformer with dimension-adaptive experts and economical attention
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1