基于改进DeepLab v3+的水下图像语义分割算法

Yongqi Yuan, Yubo Tian
{"title":"基于改进DeepLab v3+的水下图像语义分割算法","authors":"Yongqi Yuan, Yubo Tian","doi":"10.1117/12.2674688","DOIUrl":null,"url":null,"abstract":"Underwater images are easily affected by external factors such as light and water impurities, resulting in low segmentation accuracy of underwater images. Aiming to solve the problem, this paper proposes a semantic segmentation method of underwater images by introducing a new branch to improve the structure of the DeepLab v3+ model. It exploits the low-semantic features of the input image to improve the performance of the network. In the new branch, down-sampling is used to adjust the resolution of the input image, and the dimension of the feature map is adjusted by 1×1 convolution. The attention mechanism is used to focus on the channel of the feature map, and finally, the feature map of the existing branch is merged. In the experiments on the SUIM underwater dataset, the average intersection ratio of the model is 72.88%, and the average pixel accuracy is 84.03%. In the experiments on the PASCAL VOC dataset, the average intersection ratio of the model is 75.85%, and the average pixel accuracy is 84.5%. Compared with existing mainstream algorithms, the proposed algorithm achieves better results.","PeriodicalId":286364,"journal":{"name":"Conference on Computer Graphics, Artificial Intelligence, and Data Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semantic segmentation algorithm of underwater image based on improved DeepLab v3+\",\"authors\":\"Yongqi Yuan, Yubo Tian\",\"doi\":\"10.1117/12.2674688\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Underwater images are easily affected by external factors such as light and water impurities, resulting in low segmentation accuracy of underwater images. Aiming to solve the problem, this paper proposes a semantic segmentation method of underwater images by introducing a new branch to improve the structure of the DeepLab v3+ model. It exploits the low-semantic features of the input image to improve the performance of the network. In the new branch, down-sampling is used to adjust the resolution of the input image, and the dimension of the feature map is adjusted by 1×1 convolution. The attention mechanism is used to focus on the channel of the feature map, and finally, the feature map of the existing branch is merged. In the experiments on the SUIM underwater dataset, the average intersection ratio of the model is 72.88%, and the average pixel accuracy is 84.03%. In the experiments on the PASCAL VOC dataset, the average intersection ratio of the model is 75.85%, and the average pixel accuracy is 84.5%. Compared with existing mainstream algorithms, the proposed algorithm achieves better results.\",\"PeriodicalId\":286364,\"journal\":{\"name\":\"Conference on Computer Graphics, Artificial Intelligence, and Data Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conference on Computer Graphics, Artificial Intelligence, and Data Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2674688\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conference on Computer Graphics, Artificial Intelligence, and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2674688","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

水下图像容易受到光线、水杂质等外界因素的影响,导致水下图像分割精度较低。针对这一问题,本文提出了一种水下图像的语义分割方法,通过引入一个新的分支来改进DeepLab v3+模型的结构。它利用输入图像的低语义特征来提高网络的性能。在新分支中,采用降采样调整输入图像的分辨率,并通过1×1卷积调整特征图的尺寸。利用注意机制对特征图的通道进行关注,最后对已有分支的特征图进行合并。在SUIM水下数据集的实验中,模型的平均相交率为72.88%,平均像元精度为84.03%。在PASCAL VOC数据集上的实验中,该模型的平均相交率为75.85%,平均像素精度为84.5%。与现有主流算法相比,本文算法取得了更好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Semantic segmentation algorithm of underwater image based on improved DeepLab v3+
Underwater images are easily affected by external factors such as light and water impurities, resulting in low segmentation accuracy of underwater images. Aiming to solve the problem, this paper proposes a semantic segmentation method of underwater images by introducing a new branch to improve the structure of the DeepLab v3+ model. It exploits the low-semantic features of the input image to improve the performance of the network. In the new branch, down-sampling is used to adjust the resolution of the input image, and the dimension of the feature map is adjusted by 1×1 convolution. The attention mechanism is used to focus on the channel of the feature map, and finally, the feature map of the existing branch is merged. In the experiments on the SUIM underwater dataset, the average intersection ratio of the model is 72.88%, and the average pixel accuracy is 84.03%. In the experiments on the PASCAL VOC dataset, the average intersection ratio of the model is 75.85%, and the average pixel accuracy is 84.5%. Compared with existing mainstream algorithms, the proposed algorithm achieves better results.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Size and defect detection of valve based on computer vision Research on quantitative evaluation method of test flight risk based on fuzzy theory Research on target grid investment optimization technology of medium- and low-voltage distribution network based on improved genetic algorithm Research on the analysis method of civil aircraft operational safety data Research on plum target detection based on improved YOLOv3 and jetson nano
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1