A Comparison of Convolutional Neural Network Architectures for Automated Detection and Identification of Waterfowl in Complex Environments

Q3 Social Sciences GI_Forum Pub Date : 2021-01-01 DOI:10.1553/giscience2021_02_s152

Mohammad Mustafa Sa'doun, C. Lippitt, Gernot Paulus, Karl-Heinrich Anders

{"title":"A Comparison of Convolutional Neural Network Architectures for Automated Detection and Identification of Waterfowl in Complex Environments","authors":"Mohammad Mustafa Sa'doun, C. Lippitt, Gernot Paulus, Karl-Heinrich Anders","doi":"10.1553/giscience2021_02_s152","DOIUrl":null,"url":null,"abstract":"Waterfowl monitoring is an important task for understanding waterfowl distribution and habitats. Surveying approaches using hyper-spatial airborne imagery, collected by small unoccupied aerial systems (sUAS), hold potential to overcome the limitations of traditional methods while improving count efficiency and reliability. Difficulties obtaining waterfowl counts, particularly in complex image scenes, from the high quantity of imagery required hinders deployment of large-scale surveys. In this paper, we test Convolutional Neural Networks (CNNs) to understand their potential and how they behave across different versions of our waterfowl dataset. Three CNN architectures (YOLO, Retinanet and Faster RCNN) were trained on 3 hierarchical levels: waterfowl detection (True / False), waterfowl type (3 classes), and waterfowl species (8 classes). The architectures generally performed well, and results indicate that automated waterfowl detection in complex environments, and therefore enumeration, is feasible using current technology. Waterfowl identification in complex environments was not successful using the available training data, but we propose steps that might enhance the results.","PeriodicalId":29645,"journal":{"name":"GI_Forum","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"GI_Forum","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1553/giscience2021_02_s152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 1

Abstract

Waterfowl monitoring is an important task for understanding waterfowl distribution and habitats. Surveying approaches using hyper-spatial airborne imagery, collected by small unoccupied aerial systems (sUAS), hold potential to overcome the limitations of traditional methods while improving count efficiency and reliability. Difficulties obtaining waterfowl counts, particularly in complex image scenes, from the high quantity of imagery required hinders deployment of large-scale surveys. In this paper, we test Convolutional Neural Networks (CNNs) to understand their potential and how they behave across different versions of our waterfowl dataset. Three CNN architectures (YOLO, Retinanet and Faster RCNN) were trained on 3 hierarchical levels: waterfowl detection (True / False), waterfowl type (3 classes), and waterfowl species (8 classes). The architectures generally performed well, and results indicate that automated waterfowl detection in complex environments, and therefore enumeration, is feasible using current technology. Waterfowl identification in complex environments was not successful using the available training data, but we propose steps that might enhance the results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

复杂环境中水禽自动检测与识别的卷积神经网络体系结构比较

水禽监测是了解水禽分布和生境的重要任务。利用小型无人空中系统(sUAS)收集的超空间航空图像进行测量，有望克服传统方法的局限性，同时提高计数效率和可靠性。从所需的大量图像中获取水禽数量的困难，特别是在复杂的图像场景中，阻碍了大规模调查的部署。在本文中，我们测试了卷积神经网络(cnn)，以了解它们的潜力以及它们在不同版本的水禽数据集上的表现。在3个层次上训练了3种CNN架构(YOLO、Retinanet和Faster RCNN):水禽检测(真/假)、水禽类型(3类)和水禽种类(8类)。这些体系结构总体上表现良好，结果表明在复杂环境中自动水禽检测以及枚举在现有技术下是可行的。使用现有的训练数据在复杂环境中识别水禽是不成功的，但我们提出了可能增强结果的步骤。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊