LabelAR

Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology Pub Date : 2019-10-17 DOI:10.1145/3332165.3347927

Michael J. Laielli, James Smith, Giscard Biamby, Trevor Darrell, B. Hartmann

引用次数: 13

Abstract

Computer vision is applied in an ever expanding range of applications, many of which require custom training data to perform well. We present a novel interface for rapid collection of labeled training images to improve CV-based object detectors. LabelAR leverages the spatial tracking capabilities of an AR-enabled camera, allowing users to place persistent bounding volumes that stay centered on real-world objects. The interface then guides the user to move the camera to cover a wide variety of viewpoints. We eliminate the need for post hoc labeling of images by automatically projecting 2D bounding boxes around objects in the images as they are captured from AR-marked viewpoints. In a user study with 12 participants, LabelAR significantly outperforms existing approaches in terms of the trade-off between detection performance and collection time.

查看原文