{"title":"Deep learning-based lexical character identification in TV series","authors":"Paola Dalla Torre, Paolo Fantozzi, Maurizio Naldi","doi":"10.1093/llc/fqad068","DOIUrl":null,"url":null,"abstract":"Abstract Automated character identification in movies and TV series has been typically carried out through face detection in video and the association of faces with characters’ names extracted from dialogues or cast lists. We propose a deep learning architecture to identify characters based on subtitles only, precisely through the lexicon those characters employ. The identification task is formalized as a multi-class classification task. We apply our technique to the complete set of episodes in the Gomorrah TV series and achieve an average identification accuracy beyond 94 per cent on the full set of characters.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"19 1","pages":"0"},"PeriodicalIF":0.7000,"publicationDate":"2023-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Scholarship in the Humanities","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/llc/fqad068","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"HUMANITIES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract Automated character identification in movies and TV series has been typically carried out through face detection in video and the association of faces with characters’ names extracted from dialogues or cast lists. We propose a deep learning architecture to identify characters based on subtitles only, precisely through the lexicon those characters employ. The identification task is formalized as a multi-class classification task. We apply our technique to the complete set of episodes in the Gomorrah TV series and achieve an average identification accuracy beyond 94 per cent on the full set of characters.
期刊介绍:
DSH or Digital Scholarship in the Humanities is an international, peer reviewed journal which publishes original contributions on all aspects of digital scholarship in the Humanities including, but not limited to, the field of what is currently called the Digital Humanities. Long and short papers report on theoretical, methodological, experimental, and applied research and include results of research projects, descriptions and evaluations of tools, techniques, and methodologies, and reports on work in progress. DSH also publishes reviews of books and resources. Digital Scholarship in the Humanities was previously known as Literary and Linguistic Computing.