Research on building extraction method based on surveillance images

Y. Xie, Jianhua Huang, Xiyan Sun, W. Yin, Zhenghan Qiao, Yao Zhang
{"title":"Research on building extraction method based on surveillance images","authors":"Y. Xie, Jianhua Huang, Xiyan Sun, W. Yin, Zhenghan Qiao, Yao Zhang","doi":"10.1109/ICISCAE52414.2021.9590640","DOIUrl":null,"url":null,"abstract":"To solve the problems such as large workload, easy omission, low timeliness and low degree of automation in the method of visual identification of buildings in surveillance images, this paper studies the building extraction method based on surveillance images. In this paper, first of all, datasets of relevant scenes are collected and annotated. Then, we fine-tuned the Deeplabv3plus model to improve the accuracy of building extraction. Specifically, replace the backbone network with the resnet, the dilation rate is reduced to improve the detection accuracy of small objects, the output of the res net is combined with the output of the ASPP module through the way of skip connection, and the spatial details of the lower level and the semantic information of the higher level are fused. Besides, the multiple loss strategy is adopted. we also compared the fine-tuned model combined with different deep-level feature extraction networks with other classical semantic segmentation models on the open source CAMVID dataset, and the experiment showed that the combination of fine-tuned deeplabv3plus model and resnet50 reached the optimal IoU, F1 score and precision. In addition, we conducted an experimental comparison between the two training methods of only using the collected data training and the joint training of CAMVID dataset. The experiment shows that the model segmentation effect obtained by the joint training of data set is better. It significantly improves the details of the edge of the building, which can achieve robust extraction of the building.","PeriodicalId":121049,"journal":{"name":"2021 IEEE 4th International Conference on Information Systems and Computer Aided Education (ICISCAE)","volume":"9 1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th International Conference on Information Systems and Computer Aided Education (ICISCAE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICISCAE52414.2021.9590640","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

To solve the problems such as large workload, easy omission, low timeliness and low degree of automation in the method of visual identification of buildings in surveillance images, this paper studies the building extraction method based on surveillance images. In this paper, first of all, datasets of relevant scenes are collected and annotated. Then, we fine-tuned the Deeplabv3plus model to improve the accuracy of building extraction. Specifically, replace the backbone network with the resnet, the dilation rate is reduced to improve the detection accuracy of small objects, the output of the res net is combined with the output of the ASPP module through the way of skip connection, and the spatial details of the lower level and the semantic information of the higher level are fused. Besides, the multiple loss strategy is adopted. we also compared the fine-tuned model combined with different deep-level feature extraction networks with other classical semantic segmentation models on the open source CAMVID dataset, and the experiment showed that the combination of fine-tuned deeplabv3plus model and resnet50 reached the optimal IoU, F1 score and precision. In addition, we conducted an experimental comparison between the two training methods of only using the collected data training and the joint training of CAMVID dataset. The experiment shows that the model segmentation effect obtained by the joint training of data set is better. It significantly improves the details of the edge of the building, which can achieve robust extraction of the building.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于监控图像的建筑物提取方法研究
针对监控图像中建筑物视觉识别方法工作量大、易遗漏、时效性低、自动化程度低等问题,本文研究了基于监控图像的建筑物提取方法。本文首先对相关场景的数据集进行收集和标注。然后,我们对Deeplabv3plus模型进行微调,以提高建筑物提取的精度。具体来说,用resnet代替骨干网,降低膨胀率以提高小目标的检测精度,通过跳接的方式将resnet的输出与ASPP模块的输出相结合,融合下层的空间细节和上层的语义信息。采用多重亏损策略。在开源CAMVID数据集上,我们还将结合不同深度特征提取网络的微调模型与其他经典语义分割模型进行了比较,实验表明,微调后的deeplabv3plus模型与resnet50的组合达到了最优的IoU、F1分数和精度。此外,我们还对仅使用采集数据训练和CAMVID数据集联合训练两种训练方法进行了实验比较。实验表明,通过对数据集进行联合训练得到的模型分割效果较好。它显著改善了建筑物边缘的细节,可以实现对建筑物的鲁棒提取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Transient stability control system based on mapping cloud and online pre decision Tracking Control of Neural System using Adaptive Sliding Mode Control for Unknown Nonlinear Function Deep Self-Supervised Learning for Oracle Bone Inscriptions Features Representation Experimental Teaching Platform Development for Topological Sorting Algorithm Education Virtual Service Failure Recovery Algorithm Based on Particle Swarm in IPv6 Networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1