首页 > 最新文献

GEOCROWD '13最新文献

英文 中文
CrowdRoute: a crowd-sourced routing algorithm in public transit networks CrowdRoute:一种公共交通网络中的众包路由算法
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534738
To Tu Cuong
Most existing algorithms assume that public transit networks are static. However in reality, a bus may be delayed or canceled, which causes routing algorithms to generate non-optimal journeys. We propose CrowdRoute, an algorithm that exploits real-time information contributed by the crowd, to solve the earliest arrival problem in public transit networks.
大多数现有算法都假定公共交通网络是静态的。然而,在现实中,公共汽车可能会延迟或取消,这导致路由算法产生非最优行程。我们提出了一种利用人群提供的实时信息的算法CrowdRoute来解决公共交通网络中的最早到达问题。
{"title":"CrowdRoute: a crowd-sourced routing algorithm in public transit networks","authors":"To Tu Cuong","doi":"10.1145/2534732.2534738","DOIUrl":"https://doi.org/10.1145/2534732.2534738","url":null,"abstract":"Most existing algorithms assume that public transit networks are static. However in reality, a bus may be delayed or canceled, which causes routing algorithms to generate non-optimal journeys. We propose CrowdRoute, an algorithm that exploits real-time information contributed by the crowd, to solve the earliest arrival problem in public transit networks.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"72 11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130172385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
The one and many maps: participatory and temporal diversities in OpenStreetMap 一张和多张地图:OpenStreetMap中的参与性和时间多样性
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534737
Tyng-Ruey Chuang, D. Deng, Chun-Chen Hsu, R. Lemmens
OpenStreetMap is an open and collaborative project with thousands of people contributing GPS traces and other data into the making of a global map of places and networks. It is open in the sense that everyone can contribute to the project, and results from the project are free for everyone to reuse. This is contrary to traditional cartography where often a central authority controls the making of the map and its release. Is OpenStreetMap more democratic, and in what sense? Is OpenStreetMap more relevant to the mass, and how can we judge? We define and use several metrics to measure temporal properties of defined areas in OpenStreetMap, and to sample modes of participation in these areas. These metrics are used to graph the datasets representing the current OpenStreetMap so as to reveal unevenness in user participation and data temporality. We use the dataset about Taiwan as a test case to observe participatory and temporal diversities among different areas of Taiwan in OpenStreetMap.
OpenStreetMap是一个开放和协作的项目,成千上万的人贡献GPS轨迹和其他数据来制作全球地点和网络地图。它是开放的,因为每个人都可以为项目做出贡献,并且每个人都可以免费重用项目的结果。这与传统的地图学相反,传统的地图学通常由一个中央机构控制地图的制作和发布。开放地图更民主吗?在什么意义上?开放地图是否更贴近大众,我们如何判断?我们定义并使用几个指标来测量OpenStreetMap中定义区域的时间属性,并对这些区域的参与模式进行采样。这些指标用于绘制代表当前OpenStreetMap的数据集,从而揭示用户参与和数据时间性的不均匀性。我们以台湾地区的数据集作为测试案例,观察台湾不同地区在OpenStreetMap中的参与性和时间多样性。
{"title":"The one and many maps: participatory and temporal diversities in OpenStreetMap","authors":"Tyng-Ruey Chuang, D. Deng, Chun-Chen Hsu, R. Lemmens","doi":"10.1145/2534732.2534737","DOIUrl":"https://doi.org/10.1145/2534732.2534737","url":null,"abstract":"OpenStreetMap is an open and collaborative project with thousands of people contributing GPS traces and other data into the making of a global map of places and networks. It is open in the sense that everyone can contribute to the project, and results from the project are free for everyone to reuse. This is contrary to traditional cartography where often a central authority controls the making of the map and its release. Is OpenStreetMap more democratic, and in what sense? Is OpenStreetMap more relevant to the mass, and how can we judge?\u0000 We define and use several metrics to measure temporal properties of defined areas in OpenStreetMap, and to sample modes of participation in these areas. These metrics are used to graph the datasets representing the current OpenStreetMap so as to reveal unevenness in user participation and data temporality. We use the dataset about Taiwan as a test case to observe participatory and temporal diversities among different areas of Taiwan in OpenStreetMap.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129525246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Detecting spatio-temporal outliers in crowdsourced bathymetry data 众包测深数据的时空异常值检测
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534739
Leela Sedaghat, J. Hersey, M. P. McGuire
The widespread availability of Internet access and location-acquisition technologies, such as the global positioning system (GPS), has given rise to the growing phenomenon of Volunteered Geographic Information (VGI). Our work presents the use of VGI in bathymetry and hydrographic surveying and demonstrates that crowdsourced bathymetry data (CSB) can yield valuable knowledge for the maritime community. In this study, CSB data collected from 2012 to 2013 within the Baltimore Inner Harbor was used to locate anomalous depth measurements that could indicate the presence of submerged debris. To this end, we explored two approaches for detecting spatio-temporal outliers in the CSB data. In the first approach, we combined Local Outlier Factor and DBSCAN in an ensemble method to find spatio-temporal clusters of anomalous measurements that could indicate the presence of submerged debris. In the second approach, we calculated a measure of local spatial autocorrelation over time to identify "hotspots" or specific areas that consistently have low depth measurements compared to their immediate neighbors (i.e. "low-high" outliers). Results from both approaches revealed locations within the Fort McHenry Channel whose depth measurements may be indicative of the presence of submerged marine debris and, as such, may pose a threat to the safety of mariners operating in that region. Our results indicate that CSB data can not only help to improve the safety of mariners, but also serve to alert authorities in a timely manner that channel maintenance, a re-survey, and/or changes to the nautical chart may be needed.
互联网接入和位置获取技术的广泛使用,如全球定位系统(GPS),引起了日益增长的志愿地理信息(VGI)现象。我们的工作展示了VGI在测深和水文测量中的应用,并证明了众包测深数据(CSB)可以为海事界提供有价值的知识。在这项研究中,从2012年到2013年在巴尔的摩内港收集的CSB数据被用于定位异常深度测量,这可能表明水下碎片的存在。为此,我们探索了两种检测CSB数据时空异常值的方法。在第一种方法中,我们将局部离群因子(Local Outlier Factor)和DBSCAN结合在一起,以一种集合方法找到可能表明存在淹没碎片的异常测量的时空集群。在第二种方法中,我们计算了局部空间自相关随时间的度量,以识别“热点”或与其近邻(即海底)相比始终具有低深度测量的特定区域。“低”异常值)。两种方法的结果都揭示了麦克亨利堡海峡内的一些地点,其深度测量可能表明水下海洋碎片的存在,因此可能对在该区域作业的海员的安全构成威胁。我们的研究结果表明,CSB数据不仅可以帮助提高船员的安全,还可以及时提醒当局可能需要进行航道维修、重新测量和/或更改海图。
{"title":"Detecting spatio-temporal outliers in crowdsourced bathymetry data","authors":"Leela Sedaghat, J. Hersey, M. P. McGuire","doi":"10.1145/2534732.2534739","DOIUrl":"https://doi.org/10.1145/2534732.2534739","url":null,"abstract":"The widespread availability of Internet access and location-acquisition technologies, such as the global positioning system (GPS), has given rise to the growing phenomenon of Volunteered Geographic Information (VGI). Our work presents the use of VGI in bathymetry and hydrographic surveying and demonstrates that crowdsourced bathymetry data (CSB) can yield valuable knowledge for the maritime community. In this study, CSB data collected from 2012 to 2013 within the Baltimore Inner Harbor was used to locate anomalous depth measurements that could indicate the presence of submerged debris. To this end, we explored two approaches for detecting spatio-temporal outliers in the CSB data. In the first approach, we combined Local Outlier Factor and DBSCAN in an ensemble method to find spatio-temporal clusters of anomalous measurements that could indicate the presence of submerged debris. In the second approach, we calculated a measure of local spatial autocorrelation over time to identify \"hotspots\" or specific areas that consistently have low depth measurements compared to their immediate neighbors (i.e. \"low-high\" outliers). Results from both approaches revealed locations within the Fort McHenry Channel whose depth measurements may be indicative of the presence of submerged marine debris and, as such, may pose a threat to the safety of mariners operating in that region. Our results indicate that CSB data can not only help to improve the safety of mariners, but also serve to alert authorities in a timely manner that channel maintenance, a re-survey, and/or changes to the nautical chart may be needed.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130697060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Web-based enrichment of bike sensor data for automatic geo-annotation 基于web的自行车传感器数据的自动地理标注
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534744
S. Verstockt, Viktor Slavkovikj, Olivier Janssens, P. D. Potter, Jürgen Slowack, R. Walle
In this paper, we describe a multi-modal bike sensing setup for automatic geo-annotation of terrain types using web-based data enrichment. The proposed road/terrain classification system is mainly based on the analysis of volunteered geographic information gathered by bikers. By using participatory accelerometer and GPS sensor data collected from cyclists' smartphones, which is enriched with data from geographic web services, the proposed system is able to distinguish between 6 different terrain types. For the classification of the web-based enriched sensor data, the system employs a random decision forest (RDF), which compared favorably for the geo-annotation task against different classification algorithms. The system classifies every instance of road (over a 5 seconds interval) and maps the results onto the user collected GPS coordinates. Finally, based on all the collected instances, we can annotate geographic maps with the terrain types and create more advanced route statistics. The accuracy of the bike sensing system is 92% for 6-class terrain classification and 97% for 2-class on-road/off-road classification.
在本文中,我们描述了一个多模式的自行车传感装置,用于基于web的数据丰富的地形类型的自动地理标注。所提出的道路/地形分类系统主要基于对骑自行车者自愿收集的地理信息的分析。通过使用从骑行者的智能手机中收集的参与式加速度计和GPS传感器数据,并结合地理网络服务的数据,该系统能够区分6种不同的地形类型。对于基于web的丰富传感器数据的分类,该系统采用了随机决策森林(RDF),该算法相对于不同的分类算法更适合地理标注任务。该系统对每个道路实例进行分类(间隔超过5秒),并将结果映射到用户收集的GPS坐标上。最后,基于所有收集到的实例,我们可以用地形类型标注地理地图,并创建更高级的路线统计。对于6级地形分类,自行车传感系统的准确率为92%,对于2级道路/越野分类,该系统的准确率为97%。
{"title":"Web-based enrichment of bike sensor data for automatic geo-annotation","authors":"S. Verstockt, Viktor Slavkovikj, Olivier Janssens, P. D. Potter, Jürgen Slowack, R. Walle","doi":"10.1145/2534732.2534744","DOIUrl":"https://doi.org/10.1145/2534732.2534744","url":null,"abstract":"In this paper, we describe a multi-modal bike sensing setup for automatic geo-annotation of terrain types using web-based data enrichment. The proposed road/terrain classification system is mainly based on the analysis of volunteered geographic information gathered by bikers. By using participatory accelerometer and GPS sensor data collected from cyclists' smartphones, which is enriched with data from geographic web services, the proposed system is able to distinguish between 6 different terrain types. For the classification of the web-based enriched sensor data, the system employs a random decision forest (RDF), which compared favorably for the geo-annotation task against different classification algorithms. The system classifies every instance of road (over a 5 seconds interval) and maps the results onto the user collected GPS coordinates. Finally, based on all the collected instances, we can annotate geographic maps with the terrain types and create more advanced route statistics. The accuracy of the bike sensing system is 92% for 6-class terrain classification and 97% for 2-class on-road/off-road classification.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125366054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Tracing the German centennial flood in the stream of tweets: first lessons learned 在推特流中追踪德国百年洪水:第一课
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534741
G. Fuchs, N. Andrienko, G. Andrienko, Sebastian Bothe, Hendrik Stange
Social microblogging services such as Twitter result in massive streams of georeferenced messages and geolocated status updates. This real-time source of information is invaluable for many application areas, in particular for disaster detection and response scenarios. Consequently, a considerable number of works has dealt with issues of their acquisition, analysis and visualization. Most of these works not only assume an appropriate percentage of georeferenced messages that allows for detecting relevant events for a specific region and time frame, but also that these geolocations are reasonably correct in representing places and times of the underlying spatio-temporal situation. In this paper, we review these two key assumption based on the results of applying a visual analytics approach to a dataset of georeferenced Tweets from Germany over eight months witnessing several large-scale flooding situations throughout the country. Our results confirm the potential of Twitter as a distributed 'social sensor' but at the same time highlight some caveats in interpreting immediate results. To overcome these limits we explore incorporating evidence from other data sources including further social media and mobile phone network metrics to detect, confirm and refine events with respect to location and time. We summarize the lessons learned from our initial analysis by proposing recommendations and outline possible future work directions.
像Twitter这样的社交微博服务产生了大量的地理信息流和地理位置的状态更新。这种实时信息源对于许多应用领域都是无价的,特别是对于灾难检测和响应场景。因此,相当多的作品处理了它们的获取、分析和可视化问题。这些作品中的大多数不仅假设了适当比例的地理参考信息,允许检测特定区域和时间框架的相关事件,而且这些地理位置在表示潜在时空情况的地点和时间方面是合理正确的。在本文中,我们基于对来自德国的地理参考推文数据集应用可视化分析方法的结果,回顾了这两个关键假设,这些数据集在八个月内见证了全国各地的几次大规模洪水情况。我们的研究结果证实了Twitter作为分布式“社会传感器”的潜力,但同时也强调了在解释即时结果时需要注意的一些问题。为了克服这些限制,我们探索结合其他数据源的证据,包括进一步的社交媒体和移动电话网络指标,以检测、确认和完善有关地点和时间的事件。我们通过提出建议和概述未来可能的工作方向,总结初步分析的经验教训。
{"title":"Tracing the German centennial flood in the stream of tweets: first lessons learned","authors":"G. Fuchs, N. Andrienko, G. Andrienko, Sebastian Bothe, Hendrik Stange","doi":"10.1145/2534732.2534741","DOIUrl":"https://doi.org/10.1145/2534732.2534741","url":null,"abstract":"Social microblogging services such as Twitter result in massive streams of georeferenced messages and geolocated status updates. This real-time source of information is invaluable for many application areas, in particular for disaster detection and response scenarios. Consequently, a considerable number of works has dealt with issues of their acquisition, analysis and visualization. Most of these works not only assume an appropriate percentage of georeferenced messages that allows for detecting relevant events for a specific region and time frame, but also that these geolocations are reasonably correct in representing places and times of the underlying spatio-temporal situation. In this paper, we review these two key assumption based on the results of applying a visual analytics approach to a dataset of georeferenced Tweets from Germany over eight months witnessing several large-scale flooding situations throughout the country. Our results confirm the potential of Twitter as a distributed 'social sensor' but at the same time highlight some caveats in interpreting immediate results. To overcome these limits we explore incorporating evidence from other data sources including further social media and mobile phone network metrics to detect, confirm and refine events with respect to location and time. We summarize the lessons learned from our initial analysis by proposing recommendations and outline possible future work directions.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126101712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 76
MoveSafe: a framework for transportation mode-based targeted alerting in disaster response MoveSafe:灾害响应中基于运输模式的目标警报框架
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534735
Paras Mehta, S. Müller, A. Voisard
Disasters, whether natural or man-made, can occur in an unexpected and unanticipated manner causing damage and disruptions. In the event of sudden onset of a hazard, private and public transport users and pedestrians need to be informed and guided to safety. Targeted alerting in early warning systems involves the communication of personalized information to a variety of communities based on their different needs and situations to improve alert usability and compliance. In this paper, we present MoveSafe, a generic and extensible framework for transportation mode-based dynamic partitioning of a population for targeted alerting and for better transport management in hazard occurrence scenarios. We infer the transportation mode of the users dynamically using their location traces through continuous feature extraction and maintenance. In combination with the hazard location, we use the transportation mode information to find clusters of people at potentially different levels of risk and with different information needs. The framework also supports a variety of classification features, classifiers, clustering dimensions, and clustering algorithms. We evaluate its performance in different settings and present the results.
灾害,无论是自然的还是人为的,都可能以意想不到的方式发生,造成破坏和破坏。在突然发生危险的情况下,需要告知私人和公共交通工具的使用者和行人,并引导他们安全。预警系统中的目标警报涉及根据不同需求和情况向各种社区传达个性化信息,以提高警报的可用性和遵从性。在本文中,我们提出了MoveSafe,这是一个通用的、可扩展的框架,用于基于运输模式的人口动态划分,以便在危险发生的情况下进行有针对性的警报和更好的运输管理。通过持续的特征提取和维护,利用用户的位置轨迹动态推断出用户的出行方式。结合危险位置,我们使用交通方式信息来寻找具有不同潜在风险水平和不同信息需求的人群。该框架还支持各种分类特性、分类器、聚类维度和聚类算法。我们评估了它在不同设置下的性能,并给出了结果。
{"title":"MoveSafe: a framework for transportation mode-based targeted alerting in disaster response","authors":"Paras Mehta, S. Müller, A. Voisard","doi":"10.1145/2534732.2534735","DOIUrl":"https://doi.org/10.1145/2534732.2534735","url":null,"abstract":"Disasters, whether natural or man-made, can occur in an unexpected and unanticipated manner causing damage and disruptions. In the event of sudden onset of a hazard, private and public transport users and pedestrians need to be informed and guided to safety. Targeted alerting in early warning systems involves the communication of personalized information to a variety of communities based on their different needs and situations to improve alert usability and compliance. In this paper, we present MoveSafe, a generic and extensible framework for transportation mode-based dynamic partitioning of a population for targeted alerting and for better transport management in hazard occurrence scenarios. We infer the transportation mode of the users dynamically using their location traces through continuous feature extraction and maintenance. In combination with the hazard location, we use the transportation mode information to find clusters of people at potentially different levels of risk and with different information needs. The framework also supports a variety of classification features, classifiers, clustering dimensions, and clustering algorithms. We evaluate its performance in different settings and present the results.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126844820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Automatic gazetteer enrichment with user-geocoded data 自动地名词典丰富与用户地理编码的数据
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534736
J. Gelernter, Gautam Ganesh, Hamsini Krishnakumar, Wei Zhang
Geographical knowledge resources or gazetteers that are enriched with local information have the potential to add geographic precision to information retrieval. We have identified sources of novel local gazetteer entries in crowd-sourced OpenStreetMap and Wikimapia geotags that include geo-coordinates. We created a fuzzy match algorithm using machine learning (SVM) that checks both for approximate spelling and approximate geocoding in order to find duplicates between the crowd-sourced tags and the gazetteer in effort to absorb those tags that are novel. For each crowd-sourced tag, our algorithm generates candidate matches from the gazetteer and then ranks those candidates based on word form or geographical relations between each tag and gazetteer candidate. We compared a baseline of edit distance for candidate ranking to an SVM-trained candidate ranking model on a city level location tag match task. Experiment results show that the SVM greatly outperforms the baseline.
富含当地信息的地理知识资源或地名词典有可能增加信息检索的地理准确性。我们已经在众包的OpenStreetMap和维基百科地理标签(包括地理坐标)中确定了新颖的地方地名词典条目的来源。我们使用机器学习(SVM)创建了一个模糊匹配算法,该算法检查近似拼写和近似地理编码,以便在众包标签和地名词典之间找到重复的内容,以吸收那些新颖的标签。对于每个众包标签,我们的算法从地名词典中生成候选匹配项,然后根据每个标签和地名词典候选项之间的词形或地理关系对候选项进行排名。我们将候选排序的编辑距离基线与svm训练的候选排序模型在城市级别位置标记匹配任务上进行了比较。实验结果表明,支持向量机的性能大大优于基线。
{"title":"Automatic gazetteer enrichment with user-geocoded data","authors":"J. Gelernter, Gautam Ganesh, Hamsini Krishnakumar, Wei Zhang","doi":"10.1145/2534732.2534736","DOIUrl":"https://doi.org/10.1145/2534732.2534736","url":null,"abstract":"Geographical knowledge resources or gazetteers that are enriched with local information have the potential to add geographic precision to information retrieval. We have identified sources of novel local gazetteer entries in crowd-sourced OpenStreetMap and Wikimapia geotags that include geo-coordinates. We created a fuzzy match algorithm using machine learning (SVM) that checks both for approximate spelling and approximate geocoding in order to find duplicates between the crowd-sourced tags and the gazetteer in effort to absorb those tags that are novel. For each crowd-sourced tag, our algorithm generates candidate matches from the gazetteer and then ranks those candidates based on word form or geographical relations between each tag and gazetteer candidate. We compared a baseline of edit distance for candidate ranking to an SVM-trained candidate ranking model on a city level location tag match task. Experiment results show that the SVM greatly outperforms the baseline.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123416098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
On quantifying qualitative geospatial data: a probabilistic approach 定量定性地理空间数据:一种概率方法
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534742
Georgios Skoumas, D. Pfoser, Anastasios Kyrillidis
Living in the era of data deluge, we have witnessed a web content explosion, largely due to the massive availability of User-Generated Content (UGC). In this work, we specifically consider the problem of geospatial information extraction and representation, where one can exploit diverse sources of information (such as image and audio data, text data, etc), going beyond traditional volunteered geographic information. Our ambition is to include available narrative information in an effort to better explain geospatial relationships: with spatial reasoning being a basic form of human cognition, narratives expressing such experiences typically contain qualitative spatial data, i.e., spatial objects and spatial relationships. To this end, we formulate a quantitative approach for the representation of qualitative spatial relations extracted from UGC in the form of texts. The proposed method quantifies such relations based on multiple text observations. Such observations provide distance and orientation features which are utilized by a greedy Expectation Maximization-based (EM) algorithm to infer a probability distribution over predefined spatial relationships; the latter represent the quantified relationships under user-defined probabilistic assumptions. We evaluate the applicability and quality of the proposed approach using real UGC data originating from an actual travel blog text corpus. To verify the quality of the result, we generate grid-based "maps" visualizing the spatial extent of the various relations.
生活在数据泛滥的时代,我们见证了网络内容的爆炸式增长,这主要是由于用户生成内容(UGC)的大量可用性。在这项工作中,我们特别考虑了地理空间信息提取和表示的问题,人们可以利用不同的信息源(如图像和音频数据,文本数据等),超越传统的志愿地理信息。我们的目标是包括可用的叙事信息,以更好地解释地理空间关系:由于空间推理是人类认知的基本形式,表达此类经验的叙事通常包含定性空间数据,即空间对象和空间关系。为此,我们制定了一种定量的方法来表示从UGC中提取的文本形式的定性空间关系。提出的方法基于多个文本观测来量化这种关系。这些观测提供了距离和方向特征,这些特征被基于贪婪期望最大化(EM)的算法用来推断预定义空间关系上的概率分布;后者表示在用户定义的概率假设下的量化关系。我们使用来自实际旅游博客文本语料库的真实UGC数据来评估所提议方法的适用性和质量。为了验证结果的质量,我们生成了基于网格的“地图”,将各种关系的空间范围可视化。
{"title":"On quantifying qualitative geospatial data: a probabilistic approach","authors":"Georgios Skoumas, D. Pfoser, Anastasios Kyrillidis","doi":"10.1145/2534732.2534742","DOIUrl":"https://doi.org/10.1145/2534732.2534742","url":null,"abstract":"Living in the era of data deluge, we have witnessed a web content explosion, largely due to the massive availability of User-Generated Content (UGC). In this work, we specifically consider the problem of geospatial information extraction and representation, where one can exploit diverse sources of information (such as image and audio data, text data, etc), going beyond traditional volunteered geographic information. Our ambition is to include available narrative information in an effort to better explain geospatial relationships: with spatial reasoning being a basic form of human cognition, narratives expressing such experiences typically contain qualitative spatial data, i.e., spatial objects and spatial relationships.\u0000 To this end, we formulate a quantitative approach for the representation of qualitative spatial relations extracted from UGC in the form of texts. The proposed method quantifies such relations based on multiple text observations. Such observations provide distance and orientation features which are utilized by a greedy Expectation Maximization-based (EM) algorithm to infer a probability distribution over predefined spatial relationships; the latter represent the quantified relationships under user-defined probabilistic assumptions. We evaluate the applicability and quality of the proposed approach using real UGC data originating from an actual travel blog text corpus. To verify the quality of the result, we generate grid-based \"maps\" visualizing the spatial extent of the various relations.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122748557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Exploratory analysis of OpenStreetMap for land use classification OpenStreetMap用于土地利用分类的探索性分析
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534734
J. Estima, M. Painho
In the last years, volunteers have been contributing massively to what we know nowadays as Volunteered Geographic Information. This huge amount of data might be hiding a vast geographical richness and therefore research needs to be conducted to explore their potential and use it in the solution of real world problems. In this study we conduct an exploratory analysis of data from the OpenStreetMap initiative. Using the Corine Land Cover database as reference and continental Portugal as the study area, we establish a possible correspondence between both classification nomenclatures, evaluate the quality of OpenStreetMap polygon features classification against Corine Land Cover classes from level 1 nomenclature, and analyze the spatial distribution of OpenStreetMap classes over continental Portugal. A global classification accuracy around 76% and interesting coverage areas' values are remarkable and promising results that encourages us for future research on this topic.
在过去的几年里,志愿者们为我们今天所知的“志愿地理信息”做出了巨大贡献。这些庞大的数据可能隐藏着巨大的地理丰富性,因此需要进行研究以探索它们的潜力,并将其用于解决现实世界的问题。在这项研究中,我们对来自OpenStreetMap计划的数据进行了探索性分析。以Corine土地覆盖数据库为参考,以葡萄牙大陆为研究区域,建立了两种分类命名法之间可能的对应关系,并对OpenStreetMap多边形特征分类的质量与Corine土地覆盖分类的一级命名法进行了比较,分析了OpenStreetMap分类在葡萄牙大陆的空间分布。76%左右的全球分类准确率和有趣的覆盖区域值是令人瞩目和有希望的结果,这鼓励了我们对该主题的未来研究。
{"title":"Exploratory analysis of OpenStreetMap for land use classification","authors":"J. Estima, M. Painho","doi":"10.1145/2534732.2534734","DOIUrl":"https://doi.org/10.1145/2534732.2534734","url":null,"abstract":"In the last years, volunteers have been contributing massively to what we know nowadays as Volunteered Geographic Information. This huge amount of data might be hiding a vast geographical richness and therefore research needs to be conducted to explore their potential and use it in the solution of real world problems. In this study we conduct an exploratory analysis of data from the OpenStreetMap initiative. Using the Corine Land Cover database as reference and continental Portugal as the study area, we establish a possible correspondence between both classification nomenclatures, evaluate the quality of OpenStreetMap polygon features classification against Corine Land Cover classes from level 1 nomenclature, and analyze the spatial distribution of OpenStreetMap classes over continental Portugal. A global classification accuracy around 76% and interesting coverage areas' values are remarkable and promising results that encourages us for future research on this topic.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129651534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 79
A mobile sensor data acquisition and evaluation framework for crowd sourcing data 一个移动传感器数据采集和评估框架,用于众包数据
Pub Date : 2013-11-05 DOI: 10.1145/2534732.2534740
Nicolas Billen, Johannes Lauer, A. Zipf
Nowadays mobile phones and especially smart phones are common technical communication devices. Most of them are equipped with a huge number of sensors that detect environment and user interaction. The processing of the sensor measurements is a big challenge as the data is very heterogeneous and frequently available. Usage of these low cost sensors and combining them with other data sources is one of the most promising tasks and will speed up the crowd sourced (geo-)data in the future. This vision will only become reality if we are able to establish techniques to integrate different devices and test many varied situations. In this work, we present a framework for storing, fusioning and processing of mobile smartphone sensor data. Further we give an outlook on possible applications and our future work.
如今,手机,尤其是智能手机是常见的技术通信设备。它们中的大多数都配备了大量的传感器来检测环境和用户交互。传感器测量数据的处理是一个很大的挑战,因为数据非常异构和频繁可用。使用这些低成本传感器并将其与其他数据源相结合是最有前途的任务之一,并将在未来加快众包(地理)数据的速度。只有当我们能够建立技术来集成不同的设备并测试许多不同的情况时,这一愿景才会成为现实。在这项工作中,我们提出了一个存储、融合和处理移动智能手机传感器数据的框架。并对其应用前景和今后的工作进行了展望。
{"title":"A mobile sensor data acquisition and evaluation framework for crowd sourcing data","authors":"Nicolas Billen, Johannes Lauer, A. Zipf","doi":"10.1145/2534732.2534740","DOIUrl":"https://doi.org/10.1145/2534732.2534740","url":null,"abstract":"Nowadays mobile phones and especially smart phones are common technical communication devices. Most of them are equipped with a huge number of sensors that detect environment and user interaction. The processing of the sensor measurements is a big challenge as the data is very heterogeneous and frequently available. Usage of these low cost sensors and combining them with other data sources is one of the most promising tasks and will speed up the crowd sourced (geo-)data in the future. This vision will only become reality if we are able to establish techniques to integrate different devices and test many varied situations. In this work, we present a framework for storing, fusioning and processing of mobile smartphone sensor data. Further we give an outlook on possible applications and our future work.","PeriodicalId":314116,"journal":{"name":"GEOCROWD '13","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133264493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
GEOCROWD '13
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1