{"title":"基于注意机制的改进轻量级DeepLabv3+算法","authors":"Lin Wu, J. Xiao, Zhe Zhang","doi":"10.1109/icaci55529.2022.9837577","DOIUrl":null,"url":null,"abstract":"DeepLabv3+ has a wide range of applications in autonomous driving, geographic information systems, etc. However, its deployment on the mobile terminal faces a trade-off between model size and accuracy. Consecutive downsampling operations also result in a great loss of detail information. To solve these problems, this paper proposes an improved algorithm based on DeepLabv3+. Firstly, backbone is replaced by MobileNetv2 to reduce the size of the model; Secondly, the improved Atrous Spatial Pyramid Pooling module is proposed to augment the segmentation result while reducing the parameters. The performance is further ameliorated by applying attention mechanism; Finally, through refining decoder module, the proposed network makes up for lost detail information. Experiment shows that the algorithm achieves an mIoU of 73.31% on the validation set of the PASCAL VOC2012 dataset. Compared with typical algorithms, proposed algorithm has a better effect on trade-off between model size and accuracy.","PeriodicalId":412347,"journal":{"name":"2022 14th International Conference on Advanced Computational Intelligence (ICACI)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improved Lightweight DeepLabv3+ Algorithm Based on Attention Mechanism\",\"authors\":\"Lin Wu, J. Xiao, Zhe Zhang\",\"doi\":\"10.1109/icaci55529.2022.9837577\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"DeepLabv3+ has a wide range of applications in autonomous driving, geographic information systems, etc. However, its deployment on the mobile terminal faces a trade-off between model size and accuracy. Consecutive downsampling operations also result in a great loss of detail information. To solve these problems, this paper proposes an improved algorithm based on DeepLabv3+. Firstly, backbone is replaced by MobileNetv2 to reduce the size of the model; Secondly, the improved Atrous Spatial Pyramid Pooling module is proposed to augment the segmentation result while reducing the parameters. The performance is further ameliorated by applying attention mechanism; Finally, through refining decoder module, the proposed network makes up for lost detail information. Experiment shows that the algorithm achieves an mIoU of 73.31% on the validation set of the PASCAL VOC2012 dataset. Compared with typical algorithms, proposed algorithm has a better effect on trade-off between model size and accuracy.\",\"PeriodicalId\":412347,\"journal\":{\"name\":\"2022 14th International Conference on Advanced Computational Intelligence (ICACI)\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Advanced Computational Intelligence (ICACI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icaci55529.2022.9837577\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaci55529.2022.9837577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improved Lightweight DeepLabv3+ Algorithm Based on Attention Mechanism
DeepLabv3+ has a wide range of applications in autonomous driving, geographic information systems, etc. However, its deployment on the mobile terminal faces a trade-off between model size and accuracy. Consecutive downsampling operations also result in a great loss of detail information. To solve these problems, this paper proposes an improved algorithm based on DeepLabv3+. Firstly, backbone is replaced by MobileNetv2 to reduce the size of the model; Secondly, the improved Atrous Spatial Pyramid Pooling module is proposed to augment the segmentation result while reducing the parameters. The performance is further ameliorated by applying attention mechanism; Finally, through refining decoder module, the proposed network makes up for lost detail information. Experiment shows that the algorithm achieves an mIoU of 73.31% on the validation set of the PASCAL VOC2012 dataset. Compared with typical algorithms, proposed algorithm has a better effect on trade-off between model size and accuracy.