S. Chevtchenko, Rafaella F. Vale, F. Cordeiro, V. Macário
{"title":"基于深度学习的海滩图像人物检测","authors":"S. Chevtchenko, Rafaella F. Vale, F. Cordeiro, V. Macário","doi":"10.1109/BRACIS.2018.00045","DOIUrl":null,"url":null,"abstract":"Convolutional architectures have in recent years become state-of-the-art for several object detection tasks. However, these detectors have not yet been evaluated for detection and monitoring of beach areas. As some of these areas need to be continually monitored for dangerous situations, such as shark attacks, an automated system would be an effective risk control measure. The most significant and specific challenges for this problem are variable scene illumination, partial occlusion and distant camera position. In this work we present a study on three recent convolutional architectures for the task of people detection in beach scenarios. Our dataset is composed of images taken in the Boa Viagem beach, in Brazil, and is used to evaluate Faster R-CNN, R-FCN and SSD in terms of quality and speed of detection. The detectors are pretrained on a dataset containing 91 classes of objects, including people with different levels of scale and occlusion. The results suggest that the Faster R-CNN meta-architecture with the Resnet 101 feature extractor generates significantly better detections in terms of F-measure, while performing at 5.6 fps on a GTX 1080 Ti GPU.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Deep Learning for People Detection on Beach Images\",\"authors\":\"S. Chevtchenko, Rafaella F. Vale, F. Cordeiro, V. Macário\",\"doi\":\"10.1109/BRACIS.2018.00045\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional architectures have in recent years become state-of-the-art for several object detection tasks. However, these detectors have not yet been evaluated for detection and monitoring of beach areas. As some of these areas need to be continually monitored for dangerous situations, such as shark attacks, an automated system would be an effective risk control measure. The most significant and specific challenges for this problem are variable scene illumination, partial occlusion and distant camera position. In this work we present a study on three recent convolutional architectures for the task of people detection in beach scenarios. Our dataset is composed of images taken in the Boa Viagem beach, in Brazil, and is used to evaluate Faster R-CNN, R-FCN and SSD in terms of quality and speed of detection. The detectors are pretrained on a dataset containing 91 classes of objects, including people with different levels of scale and occlusion. The results suggest that the Faster R-CNN meta-architecture with the Resnet 101 feature extractor generates significantly better detections in terms of F-measure, while performing at 5.6 fps on a GTX 1080 Ti GPU.\",\"PeriodicalId\":405190,\"journal\":{\"name\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"volume\":\"27 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BRACIS.2018.00045\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2018.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
摘要
卷积架构近年来已经成为一些目标检测任务的最先进的技术。然而,这些探测器在探测和监测海滩地区方面尚未得到评价。由于其中一些区域需要持续监测危险情况,例如鲨鱼袭击,自动化系统将是一种有效的风险控制措施。这个问题最重要和最具体的挑战是可变的场景照明,部分遮挡和远距离摄像机位置。在这项工作中,我们对海滩场景中人员检测任务的三种最新卷积架构进行了研究。我们的数据集由在巴西Boa Viagem海滩拍摄的图像组成,并用于评估Faster R-CNN, R-FCN和SSD在检测质量和速度方面的性能。检测器在包含91类对象的数据集上进行预训练,包括具有不同规模和遮挡水平的人。结果表明,更快的R-CNN元架构与Resnet 101特征提取器在F-measure方面产生了显着更好的检测,而在GTX 1080 Ti GPU上以5.6 fps的速度执行。
Deep Learning for People Detection on Beach Images
Convolutional architectures have in recent years become state-of-the-art for several object detection tasks. However, these detectors have not yet been evaluated for detection and monitoring of beach areas. As some of these areas need to be continually monitored for dangerous situations, such as shark attacks, an automated system would be an effective risk control measure. The most significant and specific challenges for this problem are variable scene illumination, partial occlusion and distant camera position. In this work we present a study on three recent convolutional architectures for the task of people detection in beach scenarios. Our dataset is composed of images taken in the Boa Viagem beach, in Brazil, and is used to evaluate Faster R-CNN, R-FCN and SSD in terms of quality and speed of detection. The detectors are pretrained on a dataset containing 91 classes of objects, including people with different levels of scale and occlusion. The results suggest that the Faster R-CNN meta-architecture with the Resnet 101 feature extractor generates significantly better detections in terms of F-measure, while performing at 5.6 fps on a GTX 1080 Ti GPU.