Happy or not: Generating topic-based emotional heatmaps for Culturomics using CyberGIS

Eric Shook, Kalev H. Leetaru, G. Cao, Anand Padmanabhan, Shaowen Wang
{"title":"Happy or not: Generating topic-based emotional heatmaps for Culturomics using CyberGIS","authors":"Eric Shook, Kalev H. Leetaru, G. Cao, Anand Padmanabhan, Shaowen Wang","doi":"10.1109/ESCIENCE.2012.6404440","DOIUrl":null,"url":null,"abstract":"The field of Culturomics exploits “big data” to explore human society at population scale. Culturomics increasingly needs to consider geographic contexts and, thus, this research develops a geospatial visual analytical approach that transforms vast amounts of textual data into emotional heatmaps with fine-grained spatial resolution. Fulltext geocoding and sentiment mining extract locations and latent “tone” from text-based data, which are combined with spatial analysis methods - kernel density estimation and spatial interpolation - to generate heatmaps that capture the interplay of location, topic, and tone toward narrative impacts. To demonstrate the effectiveness of the approach, the complete English edition of Wikipedia is processed using a supercomputer to extract all locations and tone associated with the year of 2003. An emotional heatmap of Wikipedia's discussion of “armed conflict” for that year is created using the spatial analysis methods. Unlike previous research, our approach is designed for exploratory spatial analysis of topics in text archives by incorporating multiple attributes including the prominence of each location mentioned in the text, the density of a topic at each location compared to other topics, and the tone of the topics of interest into a single analysis. The generation of such fine-grained emotional heatmaps is computationally intensive particularly when accounting for the multiple attributes at fine scales. Therefore a CyberGIS platform based on national cyberinfrastructure in the United States is used to enable the computationally intensive visual analytics.","PeriodicalId":6364,"journal":{"name":"2012 IEEE 8th International Conference on E-Science","volume":"5 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"32","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 8th International Conference on E-Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESCIENCE.2012.6404440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 32

Abstract

The field of Culturomics exploits “big data” to explore human society at population scale. Culturomics increasingly needs to consider geographic contexts and, thus, this research develops a geospatial visual analytical approach that transforms vast amounts of textual data into emotional heatmaps with fine-grained spatial resolution. Fulltext geocoding and sentiment mining extract locations and latent “tone” from text-based data, which are combined with spatial analysis methods - kernel density estimation and spatial interpolation - to generate heatmaps that capture the interplay of location, topic, and tone toward narrative impacts. To demonstrate the effectiveness of the approach, the complete English edition of Wikipedia is processed using a supercomputer to extract all locations and tone associated with the year of 2003. An emotional heatmap of Wikipedia's discussion of “armed conflict” for that year is created using the spatial analysis methods. Unlike previous research, our approach is designed for exploratory spatial analysis of topics in text archives by incorporating multiple attributes including the prominence of each location mentioned in the text, the density of a topic at each location compared to other topics, and the tone of the topics of interest into a single analysis. The generation of such fine-grained emotional heatmaps is computationally intensive particularly when accounting for the multiple attributes at fine scales. Therefore a CyberGIS platform based on national cyberinfrastructure in the United States is used to enable the computationally intensive visual analytics.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
快乐与否:使用CyberGIS为文化组生成基于主题的情感热图
文化组学领域利用“大数据”在人口规模上探索人类社会。文化组学越来越需要考虑地理背景,因此,本研究开发了一种地理空间视觉分析方法,将大量文本数据转换为具有细粒度空间分辨率的情感热图。全文地理编码和情感挖掘从基于文本的数据中提取位置和潜在的“基调”,这些数据与空间分析方法(核密度估计和空间插值)相结合,生成热图,捕捉位置、主题和基调对叙事影响的相互作用。为了证明这种方法的有效性,用一台超级计算机对维基百科的完整英文版进行处理,提取出与2003年相关的所有位置和音调。使用空间分析方法创建了当年维基百科关于“武装冲突”的讨论的情感热图。与之前的研究不同,我们的方法旨在通过将多个属性(包括文本中提到的每个位置的突出性,每个位置的主题密度与其他主题相比,以及感兴趣的主题的基调)纳入单个分析,对文本档案中的主题进行探索性空间分析。生成这种细粒度的情感热图需要大量的计算,特别是在细尺度上考虑多个属性时。因此,基于美国国家网络基础设施的CyberGIS平台被用于实现计算密集型视觉分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Scientific Workflow Interchanging through Patterns: Reversals and Lessons Learned Shape Analysis Using the Spectral Graph Wavelet Transform Provenance analysis: Towards quality provenance Fast confidential search for bio-medical data using Bloom filters and Homomorphic Cryptography Calibration of watershed models using cloud computing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1