文本挖掘和语义三元组:应用人道主义法医研究中的文本空间分析

Molly Miranker, Alberto Giordano
{"title":"文本挖掘和语义三元组:应用人道主义法医研究中的文本空间分析","authors":"Molly Miranker,&nbsp;Alberto Giordano","doi":"10.1016/j.diggeo.2020.100005","DOIUrl":null,"url":null,"abstract":"<div><p>The methods and tools of Geographic Information Sciences (GIScience)—spatial analysis, spatial statistics, and geographic information technologies—are increasingly being used in forensic humanitarian projects. In this article we explore ways to parse and analyze social and media releases from the United States Customs and Border Patrol (CBP) to gain an understanding of the death of migrants at the Texas-Mexico border. The methods we used include corpus linguistic (CL)/natural language processing (NLP) and Qualitative Spatial Representation (QSR) and Semantic Triples (ST). Our results indicate that CL/NLP and QSR/ST have the potential to increase and improve deceased migrant case identification by providing a framework for searching for key terms or themes throughout multiple textual sources. In the specific case examined, however, CL/NLP showed that CBP social media focused on drug confiscation and general patrolling activities and were of limited use for tabulating incidences of migrant death. On the other hand, QSR/ST visualizations showed which CBP Stations most frequently reported deceased migrant recoveries (i.e., search and collection of human remains) and with whom they collaborated.</p><p>We believe these methods are part of the methodological toolkit needed to lay the ground for what we call Humanitarian GIS—the application of spatial analytical perspectives and tools to genocide studies, spatial forensics, and, in general, human rights topics and events. Within this toolkit, CL/NLP and QSR/ST highlight spatial relationships that are not necessarily mappable in a traditional GIS setting and allow researchers to detect patterns across large corpora of heterogeneous information. The mixing of methodologies and the combination of qualitative and quantitative data in Humanitarian GIS may change our understanding of an ongoing humanitarian crisis and aid in improving and increasing response to such crises.</p></div>","PeriodicalId":100377,"journal":{"name":"Digital Geography and Society","volume":"1 ","pages":"Article 100005"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.diggeo.2020.100005","citationCount":"8","resultStr":"{\"title\":\"Text mining and semantic triples: Spatial analyses of text in applied humanitarian forensic research\",\"authors\":\"Molly Miranker,&nbsp;Alberto Giordano\",\"doi\":\"10.1016/j.diggeo.2020.100005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The methods and tools of Geographic Information Sciences (GIScience)—spatial analysis, spatial statistics, and geographic information technologies—are increasingly being used in forensic humanitarian projects. In this article we explore ways to parse and analyze social and media releases from the United States Customs and Border Patrol (CBP) to gain an understanding of the death of migrants at the Texas-Mexico border. The methods we used include corpus linguistic (CL)/natural language processing (NLP) and Qualitative Spatial Representation (QSR) and Semantic Triples (ST). Our results indicate that CL/NLP and QSR/ST have the potential to increase and improve deceased migrant case identification by providing a framework for searching for key terms or themes throughout multiple textual sources. In the specific case examined, however, CL/NLP showed that CBP social media focused on drug confiscation and general patrolling activities and were of limited use for tabulating incidences of migrant death. On the other hand, QSR/ST visualizations showed which CBP Stations most frequently reported deceased migrant recoveries (i.e., search and collection of human remains) and with whom they collaborated.</p><p>We believe these methods are part of the methodological toolkit needed to lay the ground for what we call Humanitarian GIS—the application of spatial analytical perspectives and tools to genocide studies, spatial forensics, and, in general, human rights topics and events. Within this toolkit, CL/NLP and QSR/ST highlight spatial relationships that are not necessarily mappable in a traditional GIS setting and allow researchers to detect patterns across large corpora of heterogeneous information. The mixing of methodologies and the combination of qualitative and quantitative data in Humanitarian GIS may change our understanding of an ongoing humanitarian crisis and aid in improving and increasing response to such crises.</p></div>\",\"PeriodicalId\":100377,\"journal\":{\"name\":\"Digital Geography and Society\",\"volume\":\"1 \",\"pages\":\"Article 100005\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/j.diggeo.2020.100005\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Geography and Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666378320300052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Geography and Society","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666378320300052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

地理信息科学(GIScience)的方法和工具——空间分析、空间统计和地理信息技术——越来越多地用于法医人道主义项目。在本文中,我们探讨了解析和分析来自美国海关和边境巡逻队(CBP)的社交和媒体发布的方法,以了解德克萨斯州-墨西哥边境移民的死亡情况。我们使用的方法包括语料库语言(CL)/自然语言处理(NLP)和定性空间表征(QSR)和语义三元组(ST)。我们的研究结果表明,通过在多个文本来源中搜索关键术语或主题,CL/NLP和QSR/ST提供了一个框架,有可能增加和改善已故移民病例的识别。然而,在审查的具体案例中,劳工委员会/国家劳工组织表明,海关和边境保护局的社交媒体侧重于毒品没收和一般巡逻活动,在统计移民死亡事件方面作用有限。另一方面,QSR/ST可视化显示了哪些CBP站最常报告死亡移民的发现(即搜索和收集人类遗骸)以及他们与谁合作。我们认为,这些方法是为我们所说的人道主义地理信息系统奠定基础所需的方法论工具包的一部分——将空间分析视角和工具应用于种族灭绝研究、空间取证,以及总体上的人权主题和事件。在这个工具包中,CL/NLP和QSR/ST强调了在传统GIS设置中不一定可以映射的空间关系,并允许研究人员在大型异构信息语料库中检测模式。人道主义地理信息系统中方法的混合以及定性和定量数据的结合可能会改变我们对正在进行的人道主义危机的理解,并有助于改善和增加对此类危机的反应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Text mining and semantic triples: Spatial analyses of text in applied humanitarian forensic research

The methods and tools of Geographic Information Sciences (GIScience)—spatial analysis, spatial statistics, and geographic information technologies—are increasingly being used in forensic humanitarian projects. In this article we explore ways to parse and analyze social and media releases from the United States Customs and Border Patrol (CBP) to gain an understanding of the death of migrants at the Texas-Mexico border. The methods we used include corpus linguistic (CL)/natural language processing (NLP) and Qualitative Spatial Representation (QSR) and Semantic Triples (ST). Our results indicate that CL/NLP and QSR/ST have the potential to increase and improve deceased migrant case identification by providing a framework for searching for key terms or themes throughout multiple textual sources. In the specific case examined, however, CL/NLP showed that CBP social media focused on drug confiscation and general patrolling activities and were of limited use for tabulating incidences of migrant death. On the other hand, QSR/ST visualizations showed which CBP Stations most frequently reported deceased migrant recoveries (i.e., search and collection of human remains) and with whom they collaborated.

We believe these methods are part of the methodological toolkit needed to lay the ground for what we call Humanitarian GIS—the application of spatial analytical perspectives and tools to genocide studies, spatial forensics, and, in general, human rights topics and events. Within this toolkit, CL/NLP and QSR/ST highlight spatial relationships that are not necessarily mappable in a traditional GIS setting and allow researchers to detect patterns across large corpora of heterogeneous information. The mixing of methodologies and the combination of qualitative and quantitative data in Humanitarian GIS may change our understanding of an ongoing humanitarian crisis and aid in improving and increasing response to such crises.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.30
自引率
0.00%
发文量
0
期刊最新文献
Broker bureaucracies: The subsidiary offices of the digitalizing state (Retail) platform legitimation through municipal partnerships? Characterizing climate change sentiments in Alaska on social media Counter-mapping platformization. Rethinking the spatial differentiation of platform control, labor relations and restaurant virtualization in proprietary markets for food-delivery Synthetic geospatial data and fake geography: A case study on the implications of AI-derived data in a data-intensive society
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1