{"title":"Bounding Box Propagation for Semi-automatic Video Annotation of Nighttime Driving Scenes","authors":"Dominik Schörkhuber, Florian Groh, M. Gelautz","doi":"10.1109/ISPA52656.2021.9552141","DOIUrl":null,"url":null,"abstract":"Ground-truth annotations are a fundamental requirement for the development of computer vision and deep learning algorithms targeting autonomous driving. Available public datasets have for the most part been recorded in urban settings, while scenes showing countryside roads and nighttime driving conditions are underrepresented in current datasets. In this paper, we present a semi-automated approach for bounding box annotation which was developed in the context of nighttime driving videos. In our three-step approach, we (a) generate trajectory proposals through a tracking-by-detection method, (b) extend and verify object trajectories through single object tracking, and (c) propose a pipeline for efficient semiautomatic annotation of object bounding boxes in videos. We evaluate our approach on the CVL dataset, which focuses on nighttime driving conditions on European countryside roads. We demonstrate the improvements achieved by each processing step, and observe an increase of 23% in recall while precision remains almost constant when compared to the initial tracking-by-detection approach.","PeriodicalId":131088,"journal":{"name":"2021 12th International Symposium on Image and Signal Processing and Analysis (ISPA)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Symposium on Image and Signal Processing and Analysis (ISPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPA52656.2021.9552141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Ground-truth annotations are a fundamental requirement for the development of computer vision and deep learning algorithms targeting autonomous driving. Available public datasets have for the most part been recorded in urban settings, while scenes showing countryside roads and nighttime driving conditions are underrepresented in current datasets. In this paper, we present a semi-automated approach for bounding box annotation which was developed in the context of nighttime driving videos. In our three-step approach, we (a) generate trajectory proposals through a tracking-by-detection method, (b) extend and verify object trajectories through single object tracking, and (c) propose a pipeline for efficient semiautomatic annotation of object bounding boxes in videos. We evaluate our approach on the CVL dataset, which focuses on nighttime driving conditions on European countryside roads. We demonstrate the improvements achieved by each processing step, and observe an increase of 23% in recall while precision remains almost constant when compared to the initial tracking-by-detection approach.