Spatio-temporal provenance: Identifying location information from unstructured text

Kisung Lee, R. Ganti, M. Srivatsa, P. Mohapatra
{"title":"Spatio-temporal provenance: Identifying location information from unstructured text","authors":"Kisung Lee, R. Ganti, M. Srivatsa, P. Mohapatra","doi":"10.1109/PerComW.2013.6529548","DOIUrl":null,"url":null,"abstract":"Spatio-temporal attributes represent two aspects of physical presence - space and time - which are integral to human activities. Space-time markers of an entity in conjunction with correlation with other networks such as movements in social network, the road/transportation network encodes a wealth of provenance information. With the advent of mobile computing and cheap and improved location estimation techniques, encoding such information has become commonplace. In this paper, we will focus on deriving such location provenance information from unstructured text generated by social media. As social media such as Facebook and Twitter are integrated with mobile devices, information generated by individuals in these networks gets tagged with spatial markers. We can classify such markers into explicit and implicit tags, where explicit tags encode the spatial data explicitly by providing the accurate location attributes. On the other hand, a lot of social network data may not encode such information explicitly. Our hypothesis in this paper is that the unstructured textual data contains implicit spatial markers at a fine granularity. We develop algorithms to support this hypothesis and evaluate these algorithms on data from FourSquare to show that the spatial category information can be identified with an accuracy of over 80%.","PeriodicalId":101502,"journal":{"name":"2013 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PerComW.2013.6529548","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28

Abstract

Spatio-temporal attributes represent two aspects of physical presence - space and time - which are integral to human activities. Space-time markers of an entity in conjunction with correlation with other networks such as movements in social network, the road/transportation network encodes a wealth of provenance information. With the advent of mobile computing and cheap and improved location estimation techniques, encoding such information has become commonplace. In this paper, we will focus on deriving such location provenance information from unstructured text generated by social media. As social media such as Facebook and Twitter are integrated with mobile devices, information generated by individuals in these networks gets tagged with spatial markers. We can classify such markers into explicit and implicit tags, where explicit tags encode the spatial data explicitly by providing the accurate location attributes. On the other hand, a lot of social network data may not encode such information explicitly. Our hypothesis in this paper is that the unstructured textual data contains implicit spatial markers at a fine granularity. We develop algorithms to support this hypothesis and evaluate these algorithms on data from FourSquare to show that the spatial category information can be identified with an accuracy of over 80%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
时空来源:从非结构化文本中识别位置信息
时空属性代表了物理存在的两个方面——空间和时间——它们是人类活动不可或缺的一部分。一个实体的时空标记与其他网络(如社会网络中的运动)的相关性相结合,道路/交通网络编码了丰富的来源信息。随着移动计算的出现以及廉价和改进的位置估计技术的出现,对这些信息进行编码已经变得司空见惯。在本文中,我们将专注于从社交媒体生成的非结构化文本中获取此类位置来源信息。随着Facebook和Twitter等社交媒体与移动设备的整合,这些网络中个人产生的信息被贴上了空间标记。我们可以将这些标记分为显式标记和隐式标记,其中显式标记通过提供准确的位置属性显式地对空间数据进行编码。另一方面,许多社交网络数据可能没有明确地对这些信息进行编码。我们在本文中的假设是,非结构化文本数据在细粒度上包含隐式空间标记。我们开发了算法来支持这一假设,并在FourSquare的数据上对这些算法进行了评估,结果表明空间类别信息的识别准确率超过80%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A reconfigurable distributed CEP middleware for diverse mobility scenarios TinyBox: Social, local, mobile content sharing PIggy-backed key exchange using online services (PIKE) Towards context-aware internet services with unselfish clients Recommendations-based location privacy control
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1