Fringe projection three-dimensional (3D) measurement is an important non-contact measurement technique, and the precision of 3D reconstruction largely depends on the accuracy of phase retrieval. With the development of deep learning (DL), phase retrieval based on deep neural network (DNN) has been widely studied. The performance of neural networks plays a decisive role in determining the accuracy of phase demodulation in fringe projection profilometry (FPP). Currently, most deep learning-based wrapped phase extraction methods are built upon the U-Net architecture. Nevertheless, the hierarchical skip-connection mechanism of U-Net presents inherent limitations in global information transmission and feature fusion, which in turn restricts further improvements in phase retrieval accuracy. For this purpose, we propose a method for phase demodulation grounded in a novel multi-scale feature fusion network, referred to as SE-SwinUNet. The network combines the advantages of the Swin Transformer and residual connections, incorporating an asymmetric design in both the encoder and decoder. Through enhanced global information modeling and local detail refinement, it markedly improves the efficiency of feature propagation and utilization. Furthermore, by incorporating a channel attention mechanism (Squeeze-and-Excitation layer, SE), the network is capable of adaptively allocating appropriate weights to multi-scale features, thereby effectively reinforcing its focus on the most salient features. Experimental results demonstrate that SE-SwinUNet achieves higher accuracy in phase demodulation tasks compared to the conventional U-Net, exhibiting particularly pronounced advantages in complex scenarios.
扫码关注我们
求助内容:
应助结果提醒方式:
