Content-based image retrieval for industrial material images with deep learning and encoded physical properties

Myung Seok Shim, Christopher Thiele, Jeremy Vila, Nishank Saxena, Detlef Hohl
{"title":"Content-based image retrieval for industrial material images with deep learning and encoded physical properties","authors":"Myung Seok Shim, Christopher Thiele, Jeremy Vila, Nishank Saxena, Detlef Hohl","doi":"10.1017/dce.2023.16","DOIUrl":null,"url":null,"abstract":"Abstract Industrial materials images are an important application domain for content-based image retrieval. Users need to quickly search databases for images that exhibit similar appearance, properties, and/or features to reduce analysis turnaround time and cost. The images in this study are 2D images of millimeter-scale rock samples acquired at micrometer resolution with light microscopy or extracted from 3D micro-CT scans. Labeled rock images are expensive and time-consuming to acquire and thus are typically only available in the tens of thousands. Training a high-capacity deep learning (DL) model from scratch is therefore not practicable due to data paucity. To overcome this “few-shot learning” challenge, we propose leveraging pretrained common DL models in conjunction with transfer learning. The “similarity” of industrial materials images is subjective and assessed by human experts based on both visual appearance and physical qualities. We have emulated this human-driven assessment process via a physics-informed neural network including metadata and physical measurements in the loss function. We present a novel DL architecture that combines Siamese neural networks with a loss function that integrates classification and regression terms. The networks are trained with both image and metadata similarity (classification), and with metadata prediction (regression). For efficient inference, we use a highly compressed image feature representation, computed offline once, to search the database for images similar to a query image. Numerical experiments demonstrate superior retrieval performance of our new architecture compared with other DL and custom-feature-based approaches.","PeriodicalId":158708,"journal":{"name":"Data-Centric Engineering","volume":"281 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data-Centric Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/dce.2023.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Industrial materials images are an important application domain for content-based image retrieval. Users need to quickly search databases for images that exhibit similar appearance, properties, and/or features to reduce analysis turnaround time and cost. The images in this study are 2D images of millimeter-scale rock samples acquired at micrometer resolution with light microscopy or extracted from 3D micro-CT scans. Labeled rock images are expensive and time-consuming to acquire and thus are typically only available in the tens of thousands. Training a high-capacity deep learning (DL) model from scratch is therefore not practicable due to data paucity. To overcome this “few-shot learning” challenge, we propose leveraging pretrained common DL models in conjunction with transfer learning. The “similarity” of industrial materials images is subjective and assessed by human experts based on both visual appearance and physical qualities. We have emulated this human-driven assessment process via a physics-informed neural network including metadata and physical measurements in the loss function. We present a novel DL architecture that combines Siamese neural networks with a loss function that integrates classification and regression terms. The networks are trained with both image and metadata similarity (classification), and with metadata prediction (regression). For efficient inference, we use a highly compressed image feature representation, computed offline once, to search the database for images similar to a query image. Numerical experiments demonstrate superior retrieval performance of our new architecture compared with other DL and custom-feature-based approaches.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于内容的工业材料图像检索,具有深度学习和编码物理特性
摘要工业材料图像是基于内容的图像检索的一个重要应用领域。用户需要在数据库中快速搜索显示相似外观、属性和/或特征的图像,以减少分析周转时间和成本。本研究中的图像是用光学显微镜以微米分辨率获得的毫米级岩石样品的二维图像,或者是从三维微ct扫描中提取的图像。有标记的岩石图像既昂贵又耗时,因此通常只有成千上万的可用。因此,由于数据缺乏,从头开始训练高容量深度学习(DL)模型是不可行的。为了克服这种“少量学习”的挑战,我们建议利用预训练的通用深度学习模型与迁移学习相结合。工业材料图像的“相似性”是主观的,由人类专家根据视觉外观和物理质量进行评估。我们通过物理信息神经网络模拟了这种人为驱动的评估过程,包括损失函数中的元数据和物理测量。我们提出了一种新的深度学习架构,它将暹罗神经网络与集成分类和回归项的损失函数相结合。使用图像和元数据相似性(分类)以及元数据预测(回归)对网络进行训练。为了高效推理,我们使用高度压缩的图像特征表示,离线计算一次,在数据库中搜索与查询图像相似的图像。数值实验表明,与其他基于深度学习和自定义特征的方法相比,我们的新架构具有更好的检索性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Advancing digital healthcare engineering for aging ships and offshore structures: an in-depth review and feasibility analysis Physics-informed artificial intelligence models for the seismic response prediction of rocking structures dCNN/dCAM: anomaly precursors discovery in multivariate time series with deep convolutional neural networks Shaping the future of tunneling with data and emerging technologies Parametrized polyconvex hyperelasticity with physics-augmented neural networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1