A Serial Perspective on Photometric Stereo of Filtering and Serializing Spatial Information

Minzhe Xu;Xin Ding;You Yang;Yinqiang Zheng;Qiong Liu
{"title":"A Serial Perspective on Photometric Stereo of Filtering and Serializing Spatial Information","authors":"Minzhe Xu;Xin Ding;You Yang;Yinqiang Zheng;Qiong Liu","doi":"10.1109/TVCG.2025.3546657","DOIUrl":null,"url":null,"abstract":"In this paper, we introduce a novel method of <italic>F</i>iltering and <italic>S</i>erializing <italic>S</i>patial <italic>I</i>nformation to tackle uncalibrated photometric stereo tasks, termed FSSI-PS. Photometric stereo aims to recover surface normals from images with varying lighting and is crucial for tasks like 3D reconstruction and defect detection. Current methods in complex surface reconstruction are costly and inaccurate due to redundant feature representations from GCN or Transformer modules, caused by the weak global information extraction capability of GCNs or the large computational cost of Transformers. Furthermore, the trainset’s lack of richness in texture complexity makes reconstruction more difficult. We address these issues by optimizing feature maps and dataset richness through serializing and filtering. First, we use Mamba-RNN to optimize feature representation by directly fusing feature maps, which reduces redundancy and uses minimal computational resources. Specifically, we treat input spatial information as a sequence and serialize it by sorting. Furthermore, we introduce the Mean Angular Variation metric to assess reconstruction difficulty by measuring texture complexity. It classifies PS-Sculpture and PS-Blobby into three categories: <italic>D</i>ifficult, <italic>N</i>ormal, and <italic>S</i>imple. We use this to construct DNS-S+B, a photometric stereo training set with rich complexity levels. Our method is compared with state-of-the-art methods on the DiLiGenT and LUCES benchmarks to highlight effectiveness.","PeriodicalId":94035,"journal":{"name":"IEEE transactions on visualization and computer graphics","volume":"31 10","pages":"7448-7462"},"PeriodicalIF":6.5000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on visualization and computer graphics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10907979/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we introduce a novel method of Filtering and Serializing Spatial Information to tackle uncalibrated photometric stereo tasks, termed FSSI-PS. Photometric stereo aims to recover surface normals from images with varying lighting and is crucial for tasks like 3D reconstruction and defect detection. Current methods in complex surface reconstruction are costly and inaccurate due to redundant feature representations from GCN or Transformer modules, caused by the weak global information extraction capability of GCNs or the large computational cost of Transformers. Furthermore, the trainset’s lack of richness in texture complexity makes reconstruction more difficult. We address these issues by optimizing feature maps and dataset richness through serializing and filtering. First, we use Mamba-RNN to optimize feature representation by directly fusing feature maps, which reduces redundancy and uses minimal computational resources. Specifically, we treat input spatial information as a sequence and serialize it by sorting. Furthermore, we introduce the Mean Angular Variation metric to assess reconstruction difficulty by measuring texture complexity. It classifies PS-Sculpture and PS-Blobby into three categories: Difficult, Normal, and Simple. We use this to construct DNS-S+B, a photometric stereo training set with rich complexity levels. Our method is compared with state-of-the-art methods on the DiLiGenT and LUCES benchmarks to highlight effectiveness.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
空间信息滤波与序列化的光度立体透视。
在本文中,我们介绍了一种新的滤波和序列化空间信息的方法,以解决未校准的光度立体任务,称为FSSI-PS。光度立体旨在从不同光照的图像中恢复表面法线,对于3D重建和缺陷检测等任务至关重要。由于GCN或Transformer模块的全局信息提取能力较弱或Transformer的计算成本大,导致GCN或Transformer模块的特征表示冗余,导致当前复杂曲面重构方法成本高且不准确。此外,列车集的纹理复杂度不够丰富,使得重建更加困难。我们通过序列化和过滤优化特征映射和数据集丰富度来解决这些问题。首先,我们使用Mamba-RNN通过直接融合特征映射来优化特征表示,减少了冗余并使用了最小的计算资源。具体来说,我们将输入的空间信息视为一个序列,并通过排序对其进行序列化。此外,我们引入了平均角度变化度量,通过测量纹理复杂度来评估重建难度。它将PS-Sculpture和PS-Blobby分为三类:困难、正常和简单。我们利用它构建了具有丰富复杂度的光度立体训练集DNS-S+B。我们的方法与最先进的方法在勤奋和LUCES基准上进行比较,以突出有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
HYVE: Hybrid Vertex Encoder for Neural Distance Fields. Errata to "DiffCap: Diffusion-Based Real-Time Human Motion Capture Using Sparse IMUs and a Monocular Camera". PSAvatar: A Point-Based Shape Model for Real-Time Head Avatar Animation With 3D Gaussian Splatting. Detecting Stable Cross-Impact Patterns in Bivariate Time Series. BRep-GD: a Graph Diffusion Model for CAD Boundary Representation Generation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1