{"title":"Identifying social norm violation in movie plots: from Borat to American Pie","authors":"Yair Neuman, Yochai Cohen, Wenpeng Yin","doi":"10.1093/llc/fqad052","DOIUrl":null,"url":null,"abstract":"Abstract The violation of social norms in TV and cinema is a well-known source of humor and catharsis, and researchers in digital humanities may benefit from the automatic identification of social norm violations. In this article, we introduce a novel methodology for identifying and analyzing the violation of social norms in textual data and illustrate it in the analysis of movie plots. The methodology leans on zero-shot classification, specifically relevant when massive, labeled datasets are unavailable. We test our methodology and provide researchers with (1) a theoretically grounded tool for screening textual data for social norm violation and with new datasets that include (2) 6,806 embarrassing situations from movie plots and their hypothesized violated norm and (3) 3,059 movie plots with their average embarrassment score.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"438 1","pages":"0"},"PeriodicalIF":0.7000,"publicationDate":"2023-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Scholarship in the Humanities","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/llc/fqad052","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"HUMANITIES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract The violation of social norms in TV and cinema is a well-known source of humor and catharsis, and researchers in digital humanities may benefit from the automatic identification of social norm violations. In this article, we introduce a novel methodology for identifying and analyzing the violation of social norms in textual data and illustrate it in the analysis of movie plots. The methodology leans on zero-shot classification, specifically relevant when massive, labeled datasets are unavailable. We test our methodology and provide researchers with (1) a theoretically grounded tool for screening textual data for social norm violation and with new datasets that include (2) 6,806 embarrassing situations from movie plots and their hypothesized violated norm and (3) 3,059 movie plots with their average embarrassment score.
期刊介绍:
DSH or Digital Scholarship in the Humanities is an international, peer reviewed journal which publishes original contributions on all aspects of digital scholarship in the Humanities including, but not limited to, the field of what is currently called the Digital Humanities. Long and short papers report on theoretical, methodological, experimental, and applied research and include results of research projects, descriptions and evaluations of tools, techniques, and methodologies, and reports on work in progress. DSH also publishes reviews of books and resources. Digital Scholarship in the Humanities was previously known as Literary and Linguistic Computing.