Yang Zhou, Jiangtao Wang, Baihua Li, Q. Meng, Emanuele Rocco, Andrea Saiani
{"title":"Underwater Scene Segmentation by Deep Neural Network","authors":"Yang Zhou, Jiangtao Wang, Baihua Li, Q. Meng, Emanuele Rocco, Andrea Saiani","doi":"10.31256/UKRAS19.12","DOIUrl":null,"url":null,"abstract":"A deep neural network architecture is proposed in\nthis paper for underwater scene semantic segmentation. The\narchitecture consists of encoder and decoder networks. Pretrained VGG-16 network is used as a feature extractor, while the\ndecoder learns to expand the lower resolution feature maps. The\nnetwork applies max un-pooling operator to avoid large number\nof learnable parameters, and, in order to make use of the feature\nmaps in encoder network, it concatenates the feature maps with\ndecoder and encoder for lower resolution feature maps. Our\narchitecture shows capabilities of faster convergence and better\naccuracy. To get a clear view of underwater scene, an underwater\nenhancement neural network architecture is described in this\npaper and applied for training. It speeds up the training process\nand convergence rate in training.","PeriodicalId":424229,"journal":{"name":"UK-RAS19 Conference: \"Embedded Intelligence: Enabling and Supporting RAS Technologies\" Proceedings","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"UK-RAS19 Conference: \"Embedded Intelligence: Enabling and Supporting RAS Technologies\" Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31256/UKRAS19.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
A deep neural network architecture is proposed in
this paper for underwater scene semantic segmentation. The
architecture consists of encoder and decoder networks. Pretrained VGG-16 network is used as a feature extractor, while the
decoder learns to expand the lower resolution feature maps. The
network applies max un-pooling operator to avoid large number
of learnable parameters, and, in order to make use of the feature
maps in encoder network, it concatenates the feature maps with
decoder and encoder for lower resolution feature maps. Our
architecture shows capabilities of faster convergence and better
accuracy. To get a clear view of underwater scene, an underwater
enhancement neural network architecture is described in this
paper and applied for training. It speeds up the training process
and convergence rate in training.