{"title":"Three-dimensional surface structure reconstruction of reflective objects using multi-stage deep learning","authors":"Wenguo Li, Yuyang Yan, Hongjun Lin, Zeqian Feng","doi":"10.1007/s10043-024-00940-1","DOIUrl":null,"url":null,"abstract":"<p>With the development of deep learning and structured light streak projection techniques in three-dimensional (3D) imaging, research on the direct reconstruction of 3D shapes from single-streak images has attracted much attention. However, accurately reconstructing 3D shapes is particularly challenging when dealing with objects with specular reflective surfaces. To address this problem, this paper proposes an innovative multi-stage deep learning method that combines the pix2pix adversarial network and a modified version of the DC-HNet architecture. The technique aims to accurately reconstruct 3D shapes from streaked images by eliminating highlights in specular reflection regions through a staged process first. Specifically, the pix2pix adversarial network is first used to eliminate highlights and generate streak images without specular reflections; subsequently, the improved DC-HNet network is further processed to accurately deduce the phase distribution information of the object from the streak images with the elimination of highlights, and then reconstruct the 3D shape. Compared with the traditional self-encoder-based convolutional neural network (CNN) model, the multi-stage approach proposed in this paper significantly improves the accuracy of 3D shape reconstruction by separating the two key steps of highlight elimination and phase derivation and combining them with multi-scale feature enhancement. In this paper, the method’s effectiveness is verified on experimental data, and the results show that the proposed method provides a significant improvement in 3D shape prediction accuracy compared with the existing U-Net network and MultiResUet network. These findings not only demonstrate the innovation and robustness of the proposed method but also show its potential in scientific research and engineering applications.</p>","PeriodicalId":722,"journal":{"name":"Optical Review","volume":"80 1 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2025-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optical Review","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1007/s10043-024-00940-1","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0
Abstract
With the development of deep learning and structured light streak projection techniques in three-dimensional (3D) imaging, research on the direct reconstruction of 3D shapes from single-streak images has attracted much attention. However, accurately reconstructing 3D shapes is particularly challenging when dealing with objects with specular reflective surfaces. To address this problem, this paper proposes an innovative multi-stage deep learning method that combines the pix2pix adversarial network and a modified version of the DC-HNet architecture. The technique aims to accurately reconstruct 3D shapes from streaked images by eliminating highlights in specular reflection regions through a staged process first. Specifically, the pix2pix adversarial network is first used to eliminate highlights and generate streak images without specular reflections; subsequently, the improved DC-HNet network is further processed to accurately deduce the phase distribution information of the object from the streak images with the elimination of highlights, and then reconstruct the 3D shape. Compared with the traditional self-encoder-based convolutional neural network (CNN) model, the multi-stage approach proposed in this paper significantly improves the accuracy of 3D shape reconstruction by separating the two key steps of highlight elimination and phase derivation and combining them with multi-scale feature enhancement. In this paper, the method’s effectiveness is verified on experimental data, and the results show that the proposed method provides a significant improvement in 3D shape prediction accuracy compared with the existing U-Net network and MultiResUet network. These findings not only demonstrate the innovation and robustness of the proposed method but also show its potential in scientific research and engineering applications.
期刊介绍:
Optical Review is an international journal published by the Optical Society of Japan. The scope of the journal is:
General and physical optics;
Quantum optics and spectroscopy;
Information optics;
Photonics and optoelectronics;
Biomedical photonics and biological optics;
Lasers;
Nonlinear optics;
Optical systems and technologies;
Optical materials and manufacturing technologies;
Vision;
Infrared and short wavelength optics;
Cross-disciplinary areas such as environmental, energy, food, agriculture and space technologies;
Other optical methods and applications.