Yang Zhou, Jiangtao Wang, Baihua Li, Q. Meng, Emanuele Rocco, Andrea Saiani
{"title":"基于深度神经网络的水下场景分割","authors":"Yang Zhou, Jiangtao Wang, Baihua Li, Q. Meng, Emanuele Rocco, Andrea Saiani","doi":"10.31256/UKRAS19.12","DOIUrl":null,"url":null,"abstract":"A deep neural network architecture is proposed in\nthis paper for underwater scene semantic segmentation. The\narchitecture consists of encoder and decoder networks. Pretrained VGG-16 network is used as a feature extractor, while the\ndecoder learns to expand the lower resolution feature maps. The\nnetwork applies max un-pooling operator to avoid large number\nof learnable parameters, and, in order to make use of the feature\nmaps in encoder network, it concatenates the feature maps with\ndecoder and encoder for lower resolution feature maps. Our\narchitecture shows capabilities of faster convergence and better\naccuracy. To get a clear view of underwater scene, an underwater\nenhancement neural network architecture is described in this\npaper and applied for training. It speeds up the training process\nand convergence rate in training.","PeriodicalId":424229,"journal":{"name":"UK-RAS19 Conference: \"Embedded Intelligence: Enabling and Supporting RAS Technologies\" Proceedings","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Underwater Scene Segmentation by Deep Neural Network\",\"authors\":\"Yang Zhou, Jiangtao Wang, Baihua Li, Q. Meng, Emanuele Rocco, Andrea Saiani\",\"doi\":\"10.31256/UKRAS19.12\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A deep neural network architecture is proposed in\\nthis paper for underwater scene semantic segmentation. The\\narchitecture consists of encoder and decoder networks. Pretrained VGG-16 network is used as a feature extractor, while the\\ndecoder learns to expand the lower resolution feature maps. The\\nnetwork applies max un-pooling operator to avoid large number\\nof learnable parameters, and, in order to make use of the feature\\nmaps in encoder network, it concatenates the feature maps with\\ndecoder and encoder for lower resolution feature maps. Our\\narchitecture shows capabilities of faster convergence and better\\naccuracy. To get a clear view of underwater scene, an underwater\\nenhancement neural network architecture is described in this\\npaper and applied for training. It speeds up the training process\\nand convergence rate in training.\",\"PeriodicalId\":424229,\"journal\":{\"name\":\"UK-RAS19 Conference: \\\"Embedded Intelligence: Enabling and Supporting RAS Technologies\\\" Proceedings\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"UK-RAS19 Conference: \\\"Embedded Intelligence: Enabling and Supporting RAS Technologies\\\" Proceedings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31256/UKRAS19.12\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"UK-RAS19 Conference: \"Embedded Intelligence: Enabling and Supporting RAS Technologies\" Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31256/UKRAS19.12","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Underwater Scene Segmentation by Deep Neural Network
A deep neural network architecture is proposed in
this paper for underwater scene semantic segmentation. The
architecture consists of encoder and decoder networks. Pretrained VGG-16 network is used as a feature extractor, while the
decoder learns to expand the lower resolution feature maps. The
network applies max un-pooling operator to avoid large number
of learnable parameters, and, in order to make use of the feature
maps in encoder network, it concatenates the feature maps with
decoder and encoder for lower resolution feature maps. Our
architecture shows capabilities of faster convergence and better
accuracy. To get a clear view of underwater scene, an underwater
enhancement neural network architecture is described in this
paper and applied for training. It speeds up the training process
and convergence rate in training.