{"title":"Deep Learning Image Segmentation Based on Adaptive Total Variation Preprocessing","authors":"Guodong Wang;Yumei Ma;Zhenkuan Pan;Xuqun Zhang","doi":"10.1109/TCYB.2024.3418937","DOIUrl":null,"url":null,"abstract":"This article proposes a two-stage image segmentation method based on the MS model, aiming to enhance the segmentation accuracy of images with complex structure and background. In the first stage, in order to obtain the smooth approximate solution of the image by minimizing the energy functional, an anisotropic regularization term formed by the combination of the gradient operator and an adaptive weighted matrix is introduced. Different weights in both horizontal and vertical directions can be provided by the adaptive weighting matrix according to the gradient information, so that the curve diffuses along the directions of local feature tangents of the objects. In addition, information irrelevant to the image target can be filtered out by the adaptive weighting matrix, thus reducing the interference of complex background. The alternating direction method of multipliers (ADMMs) is employed to solve the convex optimization problem in the first stage. In the second stage, the smoothed image obtained in the first stage is segmented by the deep learning method. By comparing with some traditional methods and deep learning methods, the results demonstrate that not only has good perceptual quality been achieved by this segmentation method, but also superior evaluation metrics have been obtained.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 12","pages":"7991-7998"},"PeriodicalIF":10.5000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10716871/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
This article proposes a two-stage image segmentation method based on the MS model, aiming to enhance the segmentation accuracy of images with complex structure and background. In the first stage, in order to obtain the smooth approximate solution of the image by minimizing the energy functional, an anisotropic regularization term formed by the combination of the gradient operator and an adaptive weighted matrix is introduced. Different weights in both horizontal and vertical directions can be provided by the adaptive weighting matrix according to the gradient information, so that the curve diffuses along the directions of local feature tangents of the objects. In addition, information irrelevant to the image target can be filtered out by the adaptive weighting matrix, thus reducing the interference of complex background. The alternating direction method of multipliers (ADMMs) is employed to solve the convex optimization problem in the first stage. In the second stage, the smoothed image obtained in the first stage is segmented by the deep learning method. By comparing with some traditional methods and deep learning methods, the results demonstrate that not only has good perceptual quality been achieved by this segmentation method, but also superior evaluation metrics have been obtained.
本文提出了一种基于 MS 模型的两阶段图像分割方法,旨在提高具有复杂结构和背景的图像的分割精度。在第一阶段,为了通过最小化能量函数获得图像的平滑近似解,引入了由梯度算子和自适应加权矩阵组合而成的各向异性正则化项。自适应加权矩阵可根据梯度信息在水平和垂直方向上提供不同的权重,从而使曲线沿着物体局部特征切线的方向扩散。此外,自适应加权矩阵还能过滤掉与图像目标无关的信息,从而减少复杂背景的干扰。第一阶段采用交替方向乘法(ADMMs)解决凸优化问题。在第二阶段,利用深度学习方法对第一阶段得到的平滑图像进行分割。通过与一些传统方法和深度学习方法的比较,结果表明这种分割方法不仅获得了良好的感知质量,而且获得了更优越的评价指标。
期刊介绍:
The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.