首页 > 最新文献

Proceedings of the 10th Workshop on Geographic Information Retrieval最新文献

英文 中文
Evaluation of NER systems for the recognition of place mentions in French thematic corpora 法语专题语料库中地点提及识别的NER系统评价
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003471
Carmen Brando, Catherine Dominguès, Magali Capeyron
Ongoing initiatives promoted by cultural institutions and public administrations engage in the development of textual corpora issued from the general public. In this work, we deal with a spoken corpus of life stories and a crowd-sourced Web corpus of people's contributions related to urban planning issues in their city. Located information constitutes an essential component in these corpora. Toponyms refer to official names (e.g. Congo) which are listed in gazetteers but often to generic locations such as un endroit très beau (a beautiful place). Because of the nature of the corpora, these generic locations are inherently subjective, vague and descriptive. For enabling automated exploitation of these texts, it is crucial to properly detect such kinds of place mentions. In this sense, the present work provides a comparative study of state-of-art NER1 systems, most importantly of supervised tools such as Stanford NER, for the identification of generic locations in thematic corpora.
文化机构和公共行政部门正在推动的举措是开发公众发布的文本语料库。在这项工作中,我们处理了生活故事的口语语料库和人们对其城市规划问题的贡献的众包网络语料库。定位信息是这些语料库的重要组成部分。地名指的是在地名词典中列出的官方名称(如刚果),但通常指的是一般的地点,如unendroit tr s beau(一个美丽的地方)。由于语料库的性质,这些通用位置本质上是主观的、模糊的和描述性的。为了实现对这些文本的自动利用,正确检测这类地点提及是至关重要的。从这个意义上说,本研究提供了对最先进的NER1系统的比较研究,最重要的是斯坦福NER等监督工具,用于识别主题语料库中的通用位置。
{"title":"Evaluation of NER systems for the recognition of place mentions in French thematic corpora","authors":"Carmen Brando, Catherine Dominguès, Magali Capeyron","doi":"10.1145/3003464.3003471","DOIUrl":"https://doi.org/10.1145/3003464.3003471","url":null,"abstract":"Ongoing initiatives promoted by cultural institutions and public administrations engage in the development of textual corpora issued from the general public. In this work, we deal with a spoken corpus of life stories and a crowd-sourced Web corpus of people's contributions related to urban planning issues in their city. Located information constitutes an essential component in these corpora. Toponyms refer to official names (e.g. Congo) which are listed in gazetteers but often to generic locations such as un endroit très beau (a beautiful place). Because of the nature of the corpora, these generic locations are inherently subjective, vague and descriptive. For enabling automated exploitation of these texts, it is crucial to properly detect such kinds of place mentions. In this sense, the present work provides a comparative study of state-of-art NER1 systems, most importantly of supervised tools such as Stanford NER, for the identification of generic locations in thematic corpora.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"21 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132267279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Towards geo-referencing infrastructure for local news 为本地新闻提供地理参考基础设施
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003473
Guoray Cai, Ye Tian
Local news articles are an important source of knowledge about local events, place-specific culture, and peoples' thoughts about their environment. Reliable geocoding of such articles is the first step towards unlocking such local knowledge for community engagement and development. However, existing geo-referencing methods and tools do not work well for local news because they do not reflect the ways local people encode and communicate geographical knowledge. This paper argues that local news requires a different method and infrastructure support for effective geo-referencing. To gain insights on the unique aspects of local gazetteers and the nature of ambiguities, we present an analysis of a collection of local new articles. We found that place references in local news have their special vocabulary, and that their ambiguities are handled differently by local people. We translated such insights into a gazetteer-based geocoding solution that combines progressive geocoding with a smart footprint recommender. Progressive geocoding service uses Nominatim (OpenStreetMap) as the initial gazetteer to jump-start the construction of local gazetteer for a community and by the community. LocusRecommender automatically suggests the best matches from gazetteer ranked by a set of heuristic rules. Preliminary evaluation shows that our smart footprint recommender predicts 80% of the answers by its top-three recommendations.
当地新闻文章是了解当地事件、当地特定文化以及人们对环境的看法的重要来源。对此类文章进行可靠的地理编码,是为社区参与和发展解锁此类本地知识的第一步。然而,现有的地理参考方法和工具并不适用于地方新闻,因为它们没有反映当地人对地理知识进行编码和交流的方式。本文认为,地方新闻需要一种不同的方法和基础设施来支持有效的地理参考。为了深入了解地方地名词典的独特方面和歧义的性质,我们对当地新文章的集合进行了分析。我们发现,地方新闻中的地方引用有其特殊的词汇,其歧义的处理方式也与当地人不同。我们将这些见解转化为基于地名词典的地理编码解决方案,该解决方案将渐进式地理编码与智能足迹推荐相结合。渐进式地理编码服务使用Nominatim (OpenStreetMap)作为初始地名词典,为社区和社区启动地方地名词典的建设。LocusRecommender根据一组启发式规则自动从地名词典中推荐最佳匹配。初步评估显示,我们的智能足迹推荐程序通过前三名的推荐预测了80%的答案。
{"title":"Towards geo-referencing infrastructure for local news","authors":"Guoray Cai, Ye Tian","doi":"10.1145/3003464.3003473","DOIUrl":"https://doi.org/10.1145/3003464.3003473","url":null,"abstract":"Local news articles are an important source of knowledge about local events, place-specific culture, and peoples' thoughts about their environment. Reliable geocoding of such articles is the first step towards unlocking such local knowledge for community engagement and development. However, existing geo-referencing methods and tools do not work well for local news because they do not reflect the ways local people encode and communicate geographical knowledge. This paper argues that local news requires a different method and infrastructure support for effective geo-referencing. To gain insights on the unique aspects of local gazetteers and the nature of ambiguities, we present an analysis of a collection of local new articles. We found that place references in local news have their special vocabulary, and that their ambiguities are handled differently by local people. We translated such insights into a gazetteer-based geocoding solution that combines progressive geocoding with a smart footprint recommender. Progressive geocoding service uses Nominatim (OpenStreetMap) as the initial gazetteer to jump-start the construction of local gazetteer for a community and by the community. LocusRecommender automatically suggests the best matches from gazetteer ranked by a set of heuristic rules. Preliminary evaluation shows that our smart footprint recommender predicts 80% of the answers by its top-three recommendations.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126406191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Performance evaluation measures for toponym resolution 地名解析性能评价方法
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003472
M. Karimzadeh
In this paper, we point out to the shortcomings of precision and recall in evaluating the performance of geoparsing algorithms. We propose separate processes for evaluating toponym recognition and toponym resolution stages, and also propose new metrics that quantify the performance of toponym resolution.
本文指出了地球解析算法在精度和召回率方面存在的不足。我们提出了评估地名识别和地名解析阶段的单独过程,并提出了量化地名解析性能的新指标。
{"title":"Performance evaluation measures for toponym resolution","authors":"M. Karimzadeh","doi":"10.1145/3003464.3003472","DOIUrl":"https://doi.org/10.1145/3003464.3003472","url":null,"abstract":"In this paper, we point out to the shortcomings of precision and recall in evaluating the performance of geoparsing algorithms. We propose separate processes for evaluating toponym recognition and toponym resolution stages, and also propose new metrics that quantify the performance of toponym resolution.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122838122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Extracting spatial information from social media in support of agricultural management decisions 从社交媒体中提取空间信息,支持农业管理决策
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003468
Nevena Golubovic, C. Krintz, R. Wolski, Sara Lafia, T. Hervey, W. Kuhn
Farmers face pressure to respond to unpredictable weather, the spread of pests, and other variable events on their farms. This paper proposes a framework for data aggregation from diverse sources that extracts named places impacted by events relevant to agricultural practices. Our vision is to couple natural language processing, geocoding, and existing geographic information retrieval techniques to increase the value of already-available data through aggregation, filtering, validation, and notifications, helping farmers make timely and informed decisions with greater ease.
农民面临着应对不可预测的天气、害虫的蔓延和农场其他可变事件的压力。本文提出了一个从不同来源提取受农业实践相关事件影响的指定地点的数据汇总框架。我们的愿景是将自然语言处理、地理编码和现有地理信息检索技术结合起来,通过聚合、过滤、验证和通知来增加现有数据的价值,帮助农民更轻松地做出及时和明智的决策。
{"title":"Extracting spatial information from social media in support of agricultural management decisions","authors":"Nevena Golubovic, C. Krintz, R. Wolski, Sara Lafia, T. Hervey, W. Kuhn","doi":"10.1145/3003464.3003468","DOIUrl":"https://doi.org/10.1145/3003464.3003468","url":null,"abstract":"Farmers face pressure to respond to unpredictable weather, the spread of pests, and other variable events on their farms. This paper proposes a framework for data aggregation from diverse sources that extracts named places impacted by events relevant to agricultural practices. Our vision is to couple natural language processing, geocoding, and existing geographic information retrieval techniques to increase the value of already-available data through aggregation, filtering, validation, and notifications, helping farmers make timely and informed decisions with greater ease.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126817152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Semantic enrichment of places with VGI sources: a knowledge based approach 使用VGI源对位置进行语义丰富:基于知识的方法
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003470
Camille Tardy, G. Falquet, L. Moccozet
We propose a categorization algorithm for text content description such as tags for images from social media or crowd sourcing services, to identify places characteristics. The algorithm is based on a spatial coverage and a multi-facets categorization. We describe how it can be applied to individually process images from Flickr in order to extract geo-spatial knowledge. It is particularly dedicated for places with a small number of photos. The extraction process is done using categorization rules based on geographic and terminological knowledge resources.
我们提出了一种文本内容描述的分类算法,例如来自社交媒体或众包服务的图像标签,以识别地点特征。该算法基于空间覆盖和多面分类。我们描述了如何将其应用于单独处理来自Flickr的图像以提取地理空间知识。它特别适用于照片数量很少的地方。提取过程使用基于地理和术语知识资源的分类规则完成。
{"title":"Semantic enrichment of places with VGI sources: a knowledge based approach","authors":"Camille Tardy, G. Falquet, L. Moccozet","doi":"10.1145/3003464.3003470","DOIUrl":"https://doi.org/10.1145/3003464.3003470","url":null,"abstract":"We propose a categorization algorithm for text content description such as tags for images from social media or crowd sourcing services, to identify places characteristics. The algorithm is based on a spatial coverage and a multi-facets categorization. We describe how it can be applied to individually process images from Flickr in order to extract geo-spatial knowledge. It is particularly dedicated for places with a small number of photos. The extraction process is done using categorization rules based on geographic and terminological knowledge resources.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130619247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A holistic framework of geographical semantic web aligning 地理语义网对齐的整体框架
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003465
Li Yu, Xiliang Liu, Mingxiao Li, Peng Peng, F. Lu
Semantic aligning of heterogeneous geographical data from different sources behaves unsatisfactory on Geographical Semantic Web (GSW) due to the flat structure of GSW and the influence of spatial features. To solve this problem, this paper proposes a holistic framework for GSW aligning. This holistic framework firstly produces the initial matched results respectively for classes, properties and instances by the approval voting strategy, and then enhances these results by the mutual cooperating mechanism. Especially, spatial distance and spatial index are introduced to align instances and to improve the performance of aligning class and aligning property. To demonstrate its ability, this holistic framework is tested with two real GSWs. Compared with the state-of-the-art holistic alignment system, namely PARIS, this framework gains a large number of matched pairs. The Fl values of aligning class, aligning property and aligning instance respectively are 0.562, 0.545 and 0.646, all of which are higher than PARIS's.
由于地理语义网的扁平结构和空间特征的影响,不同来源的异构地理数据在地理语义网上的语义对齐效果不理想。为了解决这一问题,本文提出了一个GSW对齐的整体框架。该整体框架首先通过批准投票策略对类、属性和实例分别产生初始匹配结果,然后通过相互协作机制对这些结果进行增强。特别地,引入空间距离和空间索引来对齐实例,提高了对齐类和对齐属性的性能。为了证明它的能力,我们用两个真实的gsw对这个整体框架进行了测试。与目前最先进的整体对准系统PARIS相比,该框架获得了大量的匹配对。对准类、对准属性和对准实例的Fl值分别为0.562、0.545和0.646,均高于PARIS。
{"title":"A holistic framework of geographical semantic web aligning","authors":"Li Yu, Xiliang Liu, Mingxiao Li, Peng Peng, F. Lu","doi":"10.1145/3003464.3003465","DOIUrl":"https://doi.org/10.1145/3003464.3003465","url":null,"abstract":"Semantic aligning of heterogeneous geographical data from different sources behaves unsatisfactory on Geographical Semantic Web (GSW) due to the flat structure of GSW and the influence of spatial features. To solve this problem, this paper proposes a holistic framework for GSW aligning. This holistic framework firstly produces the initial matched results respectively for classes, properties and instances by the approval voting strategy, and then enhances these results by the mutual cooperating mechanism. Especially, spatial distance and spatial index are introduced to align instances and to improve the performance of aligning class and aligning property. To demonstrate its ability, this holistic framework is tested with two real GSWs. Compared with the state-of-the-art holistic alignment system, namely PARIS, this framework gains a large number of matched pairs. The Fl values of aligning class, aligning property and aligning instance respectively are 0.562, 0.545 and 0.646, all of which are higher than PARIS's.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134225462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A depth-first branch-and-bound algorithm for geocoding historic itinerary tables 历史行程表地理编码的深度优先分支定界算法
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003467
Daniel Blank, A. Henrich
The work in this paper is motivated from two different perspectives: First, gazetteers as an important data source for Geographic Information Retrieval (GIR) applications often lack historic place name information. More focused historic gazetteers are a far cry from being complete and often specialize only on certain geographic regions or time periods. Second, research on historic route descriptions---so called itineraries---is an important task in many research disciplines such as geography, linguistics, history, religion, or even medicine. This research on historic itineraries is characterized by manual, time-consuming work with only minimalistic IT support through gazetteers and map services. We address both perspectives and present a depth-first branch-and-bound (DFBnB) algorithm for deducing historic place names and thus the stops of ancient travel routes from itinerary tables. Multiple phonetic and character-based string distances are evaluated when resolving parts of an itinerary first published in 1563.
本文的研究有两个方面的动机:首先,地名词典作为地理信息检索(GIR)应用的重要数据源,往往缺乏历史地名信息。更集中的历史地名辞典远远不够完整,往往只专注于某些地理区域或时间段。其次,对历史路线描述的研究——即所谓的行程——是地理、语言学、历史、宗教甚至医学等许多研究学科的重要任务。这项历史路线研究的特点是手工、耗时的工作,只有极简的信息技术支持,通过地名词典和地图服务。我们解决了这两种观点,并提出了一种深度优先的分支边界(DFBnB)算法,用于从行程表中推断历史地名,从而推断古代旅行路线的站点。在解析1563年首次发布的行程部分时,会评估多个语音和基于字符的字符串距离。
{"title":"A depth-first branch-and-bound algorithm for geocoding historic itinerary tables","authors":"Daniel Blank, A. Henrich","doi":"10.1145/3003464.3003467","DOIUrl":"https://doi.org/10.1145/3003464.3003467","url":null,"abstract":"The work in this paper is motivated from two different perspectives: First, gazetteers as an important data source for Geographic Information Retrieval (GIR) applications often lack historic place name information. More focused historic gazetteers are a far cry from being complete and often specialize only on certain geographic regions or time periods. Second, research on historic route descriptions---so called itineraries---is an important task in many research disciplines such as geography, linguistics, history, religion, or even medicine. This research on historic itineraries is characterized by manual, time-consuming work with only minimalistic IT support through gazetteers and map services. We address both perspectives and present a depth-first branch-and-bound (DFBnB) algorithm for deducing historic place names and thus the stops of ancient travel routes from itinerary tables. Multiple phonetic and character-based string distances are evaluated when resolving parts of an itinerary first published in 1563.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116608818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Facility detection and popularity assessment from text classification of social media and crowdsourced data 基于社交媒体文本分类和众包数据的设施检测和人气评估
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003466
Kevin A. Sparks, Roger G. Li, Gautam Thakur, R. Stewart, M. Urban
Advances in technology have continually progressed our understanding of where people are, how they use the environment around them, and why they are at their current location. Having a better knowledge of when various locations become popular through space and time could have large impacts on research fields like urban dynamics and energy consumption. In this paper, we discuss the ability to identify and locate various facility types (e.g. restaurant, airport, stadiums) using social media, and assess methods in determining when these facilities become popular over time. We use standard natural language processing tools and machine learning classifiers to interpret geotagged Twitter text and determine if a user is seemingly at a location of interest when the tweet was sent. On average our classifiers are approximately 85% accurate varying across multiple facility types, with a peak precision of 98%. By using these standard methods to classify unstructured text, geotagged social media data can be an extremely useful tool to better understanding the composition of places and how and when people use them.
科技的进步使我们对人类在哪里,他们如何利用周围的环境,以及他们为什么在当前位置的理解不断加深。更好地了解不同地点何时在空间和时间上变得流行,可能会对城市动态和能源消耗等研究领域产生重大影响。在本文中,我们讨论了使用社交媒体识别和定位各种设施类型(例如餐厅,机场,体育场)的能力,并评估了确定这些设施何时随着时间的推移而流行的方法。我们使用标准的自然语言处理工具和机器学习分类器来解释地理标记的推特文本,并确定推文发送时用户是否似乎在感兴趣的位置。平均而言,我们的分类器在多个设施类型之间的准确率约为85%,峰值精度为98%。通过使用这些标准方法对非结构化文本进行分类,地理标记的社交媒体数据可以成为一个非常有用的工具,可以更好地理解地点的构成以及人们如何以及何时使用它们。
{"title":"Facility detection and popularity assessment from text classification of social media and crowdsourced data","authors":"Kevin A. Sparks, Roger G. Li, Gautam Thakur, R. Stewart, M. Urban","doi":"10.1145/3003464.3003466","DOIUrl":"https://doi.org/10.1145/3003464.3003466","url":null,"abstract":"Advances in technology have continually progressed our understanding of where people are, how they use the environment around them, and why they are at their current location. Having a better knowledge of when various locations become popular through space and time could have large impacts on research fields like urban dynamics and energy consumption. In this paper, we discuss the ability to identify and locate various facility types (e.g. restaurant, airport, stadiums) using social media, and assess methods in determining when these facilities become popular over time. We use standard natural language processing tools and machine learning classifiers to interpret geotagged Twitter text and determine if a user is seemingly at a location of interest when the tweet was sent. On average our classifiers are approximately 85% accurate varying across multiple facility types, with a peak precision of 98%. By using these standard methods to classify unstructured text, geotagged social media data can be an extremely useful tool to better understanding the composition of places and how and when people use them.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133608790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Refining imprecise spatio-temporal events: a network-based approach 精炼不精确的时空事件:基于网络的方法
Pub Date : 2016-10-31 DOI: 10.1145/3003464.3003469
Andreas Spitz, Johanna Geiß, Michael Gertz, Stefan Hagedorn, K. Sattler
Events as composites of temporal, spatial and actor information are a central object of interest in many information retrieval (IR) scenarios. There are several challenges to such event-centric IR, which range from the detection and extraction of geographic, temporal and actor mentions in documents to the construction of event descriptions as triples of locations, dates, and actors that can support event query scenarios. For the latter challenge, existing approaches fall short when dealing with imprecise event components. For example, if the exact location or date is unknown, existing IR methods are often unaware of different granularity levels and the conceptual proximity of dates or locations. To address these problems, we present a framework that efficiently answers imprecise event queries, whose geographic or temporal component is given only at a coarse granularity level. Our approach utilizes a network-based event model that includes location, date, and actor components that are extracted from large document collections. Instances of entity and event mentions in the network are weighted based on both their frequency of occurrence and textual distance to reflect semantic relatedness. We demonstrate the utility and flexibility of our approach for evaluating imprecise event queries based on a large collection of events extracted from the English Wikipedia for a ground truth of news events.
事件作为时间、空间和参与者信息的组合是许多信息检索(IR)场景中感兴趣的中心对象。这种以事件为中心的IR存在一些挑战,从检测和提取文档中提到的地理、时间和参与者,到将事件描述构建为可以支持事件查询场景的位置、日期和参与者的三元组。对于后一种挑战,现有方法在处理不精确的事件组件时存在不足。例如,如果确切的位置或日期是未知的,现有的IR方法通常不知道不同的粒度级别和日期或位置的概念接近度。为了解决这些问题,我们提出了一个框架,该框架可以有效地回答不精确的事件查询,其地理或时间组件仅在粗粒度级别上给出。我们的方法利用基于网络的事件模型,该模型包括从大型文档集合中提取的位置、日期和参与者组件。网络中实体和事件被提及的实例根据其出现频率和文本距离进行加权,以反映语义相关性。我们展示了我们的方法的实用性和灵活性,该方法基于从英文维基百科中提取的大量事件来评估不精确的事件查询,以获得新闻事件的基本真相。
{"title":"Refining imprecise spatio-temporal events: a network-based approach","authors":"Andreas Spitz, Johanna Geiß, Michael Gertz, Stefan Hagedorn, K. Sattler","doi":"10.1145/3003464.3003469","DOIUrl":"https://doi.org/10.1145/3003464.3003469","url":null,"abstract":"Events as composites of temporal, spatial and actor information are a central object of interest in many information retrieval (IR) scenarios. There are several challenges to such event-centric IR, which range from the detection and extraction of geographic, temporal and actor mentions in documents to the construction of event descriptions as triples of locations, dates, and actors that can support event query scenarios. For the latter challenge, existing approaches fall short when dealing with imprecise event components. For example, if the exact location or date is unknown, existing IR methods are often unaware of different granularity levels and the conceptual proximity of dates or locations. To address these problems, we present a framework that efficiently answers imprecise event queries, whose geographic or temporal component is given only at a coarse granularity level. Our approach utilizes a network-based event model that includes location, date, and actor components that are extracted from large document collections. Instances of entity and event mentions in the network are weighted based on both their frequency of occurrence and textual distance to reflect semantic relatedness. We demonstrate the utility and flexibility of our approach for evaluating imprecise event queries based on a large collection of events extracted from the English Wikipedia for a ground truth of news events.","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115711310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Proceedings of the 10th Workshop on Geographic Information Retrieval 第十届地理信息检索研讨会论文集
{"title":"Proceedings of the 10th Workshop on Geographic Information Retrieval","authors":"","doi":"10.1145/3003464","DOIUrl":"https://doi.org/10.1145/3003464","url":null,"abstract":"","PeriodicalId":308638,"journal":{"name":"Proceedings of the 10th Workshop on Geographic Information Retrieval","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115843426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 10th Workshop on Geographic Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1