Industrial surface defect detection is constrained by the scarcity of defective samples and by the insufficient capacity of current segmentation methods to precisely delineate defect boundaries. To address these challenges, we propose F2P-Net, a few-sample, highly precise industrial surface defect segmentation framework composed of three core modules. ViCNet (ViT and CNN collaborative encoder network) integrates a vision transformer backbone with an auxiliary convolutional branch to retain robust large-model priors while enhancing sensitivity to fine-scale textures and local irregularities. AFDec (automated geometric prompt and multi-scale feature fusion decoder) employs automated geometric prompts to localize potential defect regions and fuses hierarchical multi-scale features to improve boundary delineation and mask consistency. EVPT (edge-enhanced visual prompt tuning) is a fine-tuning module incorporating edge-explicit visual prompt to facilitate effective industrial domain adaptation of large vision models. The proposed method achieves considerable performance over existing full-data training approaches in metrics including mAP, Recall, and IoU using only 1.76 %∼3.06 % of training images across NEU_Seg, MT, KolektorSDD2, and DAGM2007 datasets. Under full-data training, it attains state-of-the-art segmentation accuracies with IoU scores of 86.03 %, 92.57 %, 78.77 %, and 82.55 %, respectively. The network provides a novel solution for industrial applications with few-sample, high-precision defect segmentation. Code is available at https://github.com/kerongYan/F2P-Net.
扫码关注我们
求助内容:
应助结果提醒方式:
