{"title":"改进的条件GAN航空图像分割","authors":"M. Dimoiu, D. Popescu, L. Ichim","doi":"10.1109/africon51333.2021.9570942","DOIUrl":null,"url":null,"abstract":"Generative Adversarial Network (GAN) is an algorithmic architecture containing two neural networks, placed against each other to generate new synthetic images and it has been used successfully in image segmentation. The paper analyzes different GAN implementations for segmentation of images acquired by aerial robots in a real context of a rural zone in Romania. To improve the segmentation performance, a new GAN network is proposed by adding a new layer. Data augmentation was done by the following techniques: mirroring, rotation, scaling, gray scaling, blurring, sharpening, etc. Five classes of region of interest are considered: floods, vegetations, buildings, roads, and dry land. GAN implementations were tested on CPU, GPU, and TPU, on individual computing devices and in the cloud. A new layer was added. The performances were analyzed in terms of learning time, operating time, and statistical indicators. The batch size was generally low: batches of 1, 4 or 16 images were used in this paper. The results confirm that the use of batch achieves the best training and generalization performance in terms of computational cost, for a wide range of experiments.","PeriodicalId":170342,"journal":{"name":"2021 IEEE AFRICON","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improved Conditional GAN for Aerial Image Segmentation\",\"authors\":\"M. Dimoiu, D. Popescu, L. Ichim\",\"doi\":\"10.1109/africon51333.2021.9570942\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Generative Adversarial Network (GAN) is an algorithmic architecture containing two neural networks, placed against each other to generate new synthetic images and it has been used successfully in image segmentation. The paper analyzes different GAN implementations for segmentation of images acquired by aerial robots in a real context of a rural zone in Romania. To improve the segmentation performance, a new GAN network is proposed by adding a new layer. Data augmentation was done by the following techniques: mirroring, rotation, scaling, gray scaling, blurring, sharpening, etc. Five classes of region of interest are considered: floods, vegetations, buildings, roads, and dry land. GAN implementations were tested on CPU, GPU, and TPU, on individual computing devices and in the cloud. A new layer was added. The performances were analyzed in terms of learning time, operating time, and statistical indicators. The batch size was generally low: batches of 1, 4 or 16 images were used in this paper. The results confirm that the use of batch achieves the best training and generalization performance in terms of computational cost, for a wide range of experiments.\",\"PeriodicalId\":170342,\"journal\":{\"name\":\"2021 IEEE AFRICON\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE AFRICON\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/africon51333.2021.9570942\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE AFRICON","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/africon51333.2021.9570942","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Conditional GAN for Aerial Image Segmentation
Generative Adversarial Network (GAN) is an algorithmic architecture containing two neural networks, placed against each other to generate new synthetic images and it has been used successfully in image segmentation. The paper analyzes different GAN implementations for segmentation of images acquired by aerial robots in a real context of a rural zone in Romania. To improve the segmentation performance, a new GAN network is proposed by adding a new layer. Data augmentation was done by the following techniques: mirroring, rotation, scaling, gray scaling, blurring, sharpening, etc. Five classes of region of interest are considered: floods, vegetations, buildings, roads, and dry land. GAN implementations were tested on CPU, GPU, and TPU, on individual computing devices and in the cloud. A new layer was added. The performances were analyzed in terms of learning time, operating time, and statistical indicators. The batch size was generally low: batches of 1, 4 or 16 images were used in this paper. The results confirm that the use of batch achieves the best training and generalization performance in terms of computational cost, for a wide range of experiments.