Tohme:在谷歌街景中使用众包、计算机视觉和机器学习来检测路边坡道

Proceedings of the 27th annual ACM symposium on User interface software and technology Pub Date : 2014-10-05 DOI:10.1145/2642918.2647403

Kotaro Hara, J. Sun, Robert Moore, D. Jacobs, Jon E. Froehlich

{"title":"Tohme:在谷歌街景中使用众包、计算机视觉和机器学习来检测路边坡道","authors":"Kotaro Hara, J. Sun, Robert Moore, D. Jacobs, Jon E. Froehlich","doi":"10.1145/2642918.2647403","DOIUrl":null,"url":null,"abstract":"Building on recent prior work that combines Google Street View (GSV) and crowdsourcing to remotely collect information on physical world accessibility, we present the first 'smart' system, Tohme, that combines machine learning, computer vision (CV), and custom crowd interfaces to find curb ramps remotely in GSV scenes. Tohme consists of two workflows, a human labeling pipeline and a CV pipeline with human verification, which are scheduled dynamically based on predicted performance. Using 1,086 GSV scenes (street intersections) from four North American cities and data from 403 crowd workers, we show that Tohme performs similarly in detecting curb ramps compared to a manual labeling approach alone (F- measure: 84% vs. 86% baseline) but at a 13% reduction in time cost. Our work contributes the first CV-based curb ramp detection system, a custom machine-learning based workflow controller, a validation of GSV as a viable curb ramp data source, and a detailed examination of why curb ramp detection is a hard problem along with steps forward.","PeriodicalId":20543,"journal":{"name":"Proceedings of the 27th annual ACM symposium on User interface software and technology","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2014-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"101","resultStr":"{\"title\":\"Tohme: detecting curb ramps in google street view using crowdsourcing, computer vision, and machine learning\",\"authors\":\"Kotaro Hara, J. Sun, Robert Moore, D. Jacobs, Jon E. Froehlich\",\"doi\":\"10.1145/2642918.2647403\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Building on recent prior work that combines Google Street View (GSV) and crowdsourcing to remotely collect information on physical world accessibility, we present the first 'smart' system, Tohme, that combines machine learning, computer vision (CV), and custom crowd interfaces to find curb ramps remotely in GSV scenes. Tohme consists of two workflows, a human labeling pipeline and a CV pipeline with human verification, which are scheduled dynamically based on predicted performance. Using 1,086 GSV scenes (street intersections) from four North American cities and data from 403 crowd workers, we show that Tohme performs similarly in detecting curb ramps compared to a manual labeling approach alone (F- measure: 84% vs. 86% baseline) but at a 13% reduction in time cost. Our work contributes the first CV-based curb ramp detection system, a custom machine-learning based workflow controller, a validation of GSV as a viable curb ramp data source, and a detailed examination of why curb ramp detection is a hard problem along with steps forward.\",\"PeriodicalId\":20543,\"journal\":{\"name\":\"Proceedings of the 27th annual ACM symposium on User interface software and technology\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-10-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"101\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 27th annual ACM symposium on User interface software and technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2642918.2647403\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 27th annual ACM symposium on User interface software and technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2642918.2647403","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 101

摘要

在最近结合谷歌街景(GSV)和众包来远程收集物理世界可达性信息的工作的基础上，我们提出了第一个“智能”系统Tohme，它结合了机器学习、计算机视觉(CV)和自定义人群界面，可以在GSV场景中远程找到路缘坡道。Tohme由两个工作流组成，一个人工标记管道和一个人工验证的CV管道，它们是根据预测的性能动态调度的。使用来自四个北美城市的1,086个GSV场景(街道路口)和403名人群工作人员的数据，我们发现Tohme在检测路边坡道方面的表现与单独的手动标记方法相似(F-测量值:84%对86%基线)，但时间成本降低了13%。我们的工作贡献了第一个基于cv的路缘匝道检测系统，一个基于定制机器学习的工作流控制器，验证了GSV作为可行的路缘匝道数据源，并详细检查了为什么路缘匝道检测是一个难题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Tohme: detecting curb ramps in google street view using crowdsourcing, computer vision, and machine learning

Building on recent prior work that combines Google Street View (GSV) and crowdsourcing to remotely collect information on physical world accessibility, we present the first 'smart' system, Tohme, that combines machine learning, computer vision (CV), and custom crowd interfaces to find curb ramps remotely in GSV scenes. Tohme consists of two workflows, a human labeling pipeline and a CV pipeline with human verification, which are scheduled dynamically based on predicted performance. Using 1,086 GSV scenes (street intersections) from four North American cities and data from 403 crowd workers, we show that Tohme performs similarly in detecting curb ramps compared to a manual labeling approach alone (F- measure: 84% vs. 86% baseline) but at a 13% reduction in time cost. Our work contributes the first CV-based curb ramp detection system, a custom machine-learning based workflow controller, a validation of GSV as a viable curb ramp data source, and a detailed examination of why curb ramp detection is a hard problem along with steps forward.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 27th annual ACM symposium on User interface software and technology

自引率

0.00%

发文量

期刊最新文献

Designer's augmented reality toolkit, ten years later: implications for new media authoring tools Tag system with low-powered tag and depth sensing camera In-air gestures around unmodified mobile devices CommandSpace: modeling the relationships between tasks, descriptions and features WirePrint: 3D printed previews for fast prototyping