{"title":"文本挖掘和语义三元组:应用人道主义法医研究中的文本空间分析","authors":"Molly Miranker, Alberto Giordano","doi":"10.1016/j.diggeo.2020.100005","DOIUrl":null,"url":null,"abstract":"<div><p>The methods and tools of Geographic Information Sciences (GIScience)—spatial analysis, spatial statistics, and geographic information technologies—are increasingly being used in forensic humanitarian projects. In this article we explore ways to parse and analyze social and media releases from the United States Customs and Border Patrol (CBP) to gain an understanding of the death of migrants at the Texas-Mexico border. The methods we used include corpus linguistic (CL)/natural language processing (NLP) and Qualitative Spatial Representation (QSR) and Semantic Triples (ST). Our results indicate that CL/NLP and QSR/ST have the potential to increase and improve deceased migrant case identification by providing a framework for searching for key terms or themes throughout multiple textual sources. In the specific case examined, however, CL/NLP showed that CBP social media focused on drug confiscation and general patrolling activities and were of limited use for tabulating incidences of migrant death. On the other hand, QSR/ST visualizations showed which CBP Stations most frequently reported deceased migrant recoveries (i.e., search and collection of human remains) and with whom they collaborated.</p><p>We believe these methods are part of the methodological toolkit needed to lay the ground for what we call Humanitarian GIS—the application of spatial analytical perspectives and tools to genocide studies, spatial forensics, and, in general, human rights topics and events. Within this toolkit, CL/NLP and QSR/ST highlight spatial relationships that are not necessarily mappable in a traditional GIS setting and allow researchers to detect patterns across large corpora of heterogeneous information. The mixing of methodologies and the combination of qualitative and quantitative data in Humanitarian GIS may change our understanding of an ongoing humanitarian crisis and aid in improving and increasing response to such crises.</p></div>","PeriodicalId":100377,"journal":{"name":"Digital Geography and Society","volume":"1 ","pages":"Article 100005"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.diggeo.2020.100005","citationCount":"8","resultStr":"{\"title\":\"Text mining and semantic triples: Spatial analyses of text in applied humanitarian forensic research\",\"authors\":\"Molly Miranker, Alberto Giordano\",\"doi\":\"10.1016/j.diggeo.2020.100005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The methods and tools of Geographic Information Sciences (GIScience)—spatial analysis, spatial statistics, and geographic information technologies—are increasingly being used in forensic humanitarian projects. In this article we explore ways to parse and analyze social and media releases from the United States Customs and Border Patrol (CBP) to gain an understanding of the death of migrants at the Texas-Mexico border. The methods we used include corpus linguistic (CL)/natural language processing (NLP) and Qualitative Spatial Representation (QSR) and Semantic Triples (ST). Our results indicate that CL/NLP and QSR/ST have the potential to increase and improve deceased migrant case identification by providing a framework for searching for key terms or themes throughout multiple textual sources. In the specific case examined, however, CL/NLP showed that CBP social media focused on drug confiscation and general patrolling activities and were of limited use for tabulating incidences of migrant death. On the other hand, QSR/ST visualizations showed which CBP Stations most frequently reported deceased migrant recoveries (i.e., search and collection of human remains) and with whom they collaborated.</p><p>We believe these methods are part of the methodological toolkit needed to lay the ground for what we call Humanitarian GIS—the application of spatial analytical perspectives and tools to genocide studies, spatial forensics, and, in general, human rights topics and events. Within this toolkit, CL/NLP and QSR/ST highlight spatial relationships that are not necessarily mappable in a traditional GIS setting and allow researchers to detect patterns across large corpora of heterogeneous information. The mixing of methodologies and the combination of qualitative and quantitative data in Humanitarian GIS may change our understanding of an ongoing humanitarian crisis and aid in improving and increasing response to such crises.</p></div>\",\"PeriodicalId\":100377,\"journal\":{\"name\":\"Digital Geography and Society\",\"volume\":\"1 \",\"pages\":\"Article 100005\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/j.diggeo.2020.100005\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Geography and Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666378320300052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Geography and Society","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666378320300052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text mining and semantic triples: Spatial analyses of text in applied humanitarian forensic research
The methods and tools of Geographic Information Sciences (GIScience)—spatial analysis, spatial statistics, and geographic information technologies—are increasingly being used in forensic humanitarian projects. In this article we explore ways to parse and analyze social and media releases from the United States Customs and Border Patrol (CBP) to gain an understanding of the death of migrants at the Texas-Mexico border. The methods we used include corpus linguistic (CL)/natural language processing (NLP) and Qualitative Spatial Representation (QSR) and Semantic Triples (ST). Our results indicate that CL/NLP and QSR/ST have the potential to increase and improve deceased migrant case identification by providing a framework for searching for key terms or themes throughout multiple textual sources. In the specific case examined, however, CL/NLP showed that CBP social media focused on drug confiscation and general patrolling activities and were of limited use for tabulating incidences of migrant death. On the other hand, QSR/ST visualizations showed which CBP Stations most frequently reported deceased migrant recoveries (i.e., search and collection of human remains) and with whom they collaborated.
We believe these methods are part of the methodological toolkit needed to lay the ground for what we call Humanitarian GIS—the application of spatial analytical perspectives and tools to genocide studies, spatial forensics, and, in general, human rights topics and events. Within this toolkit, CL/NLP and QSR/ST highlight spatial relationships that are not necessarily mappable in a traditional GIS setting and allow researchers to detect patterns across large corpora of heterogeneous information. The mixing of methodologies and the combination of qualitative and quantitative data in Humanitarian GIS may change our understanding of an ongoing humanitarian crisis and aid in improving and increasing response to such crises.