Underwater Scene Segmentation by Deep Neural Network

UK-RAS19 Conference: "Embedded Intelligence: Enabling and Supporting RAS Technologies" Proceedings Pub Date : 1900-01-01 DOI:10.31256/UKRAS19.12

Yang Zhou, Jiangtao Wang, Baihua Li, Q. Meng, Emanuele Rocco, Andrea Saiani

引用次数: 6

Abstract

A deep neural network architecture is proposed in this paper for underwater scene semantic segmentation. The architecture consists of encoder and decoder networks. Pretrained VGG-16 network is used as a feature extractor, while the decoder learns to expand the lower resolution feature maps. The network applies max un-pooling operator to avoid large number of learnable parameters, and, in order to make use of the feature maps in encoder network, it concatenates the feature maps with decoder and encoder for lower resolution feature maps. Our architecture shows capabilities of faster convergence and better accuracy. To get a clear view of underwater scene, an underwater enhancement neural network architecture is described in this paper and applied for training. It speeds up the training process and convergence rate in training.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度神经网络的水下场景分割

本文提出了一种用于水下场景语义分割的深度神经网络结构。该架构由编码器和解码器网络组成。使用预训练的VGG-16网络作为特征提取器，而解码器学习扩展低分辨率特征映射。该网络采用最大解池算子，避免了大量可学习参数的产生，并且为了充分利用编码器网络中的特征映射，将特征映射与解码器和编码器连接起来，以获得较低分辨率的特征映射。我们的架构显示出更快的收敛和更好的精度。为了获得清晰的水下场景视图，本文描述了一种水下增强神经网络架构，并将其应用于训练。它加快了训练过程和训练收敛速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

UK-RAS19 Conference: "Embedded Intelligence: Enabling and Supporting RAS Technologies" Proceedings

自引率

0.00%

发文量