Travels with BERT: Surfacing the intertextuality in Hans Christian Andersen's travel writing and fairy tales through the network lens of large language model‐based topic modeling
{"title":"Travels with BERT: Surfacing the intertextuality in Hans Christian Andersen's travel writing and fairy tales through the network lens of large language model‐based topic modeling","authors":"Timothy R. Tangherlini, Ruofei Chen","doi":"10.1111/oli.12458","DOIUrl":null,"url":null,"abstract":"Hans Christian Andersen's fairy tales have garnered the greatest popular and scholarly attention despite the interdependence of works across the broad range of his artistic production. We read Andersen's fairy tales in concert with his travel writing to highlight the intertextual aspects that cross these seemingly distinct genres. We leverage recent advances in large language models (LLM) and network theory to generate representations that facilitate user exploration of these intertextual interdependencies across genres and across time. In the first part of our study, we use BERTopic and an LLM model fine‐tuned for nineteenth‐century Danish literary language to present independent and combined topic models of the two corpuses. This approach supports multi‐scalar analysis of intertextual elements within and across these corpuses, thereby implementing a method for macroscopic reading. In the second part of the study, we develop a series of networked representations of the dependencies between fairy tales, where these dependencies are generated on the basis of the shared intertextual topic space of the fairy tales and the travel writing.","PeriodicalId":42582,"journal":{"name":"ORBIS LITTERARUM","volume":null,"pages":null},"PeriodicalIF":0.2000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ORBIS LITTERARUM","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/oli.12458","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LITERATURE","Score":null,"Total":0}
引用次数: 0
Abstract
Hans Christian Andersen's fairy tales have garnered the greatest popular and scholarly attention despite the interdependence of works across the broad range of his artistic production. We read Andersen's fairy tales in concert with his travel writing to highlight the intertextual aspects that cross these seemingly distinct genres. We leverage recent advances in large language models (LLM) and network theory to generate representations that facilitate user exploration of these intertextual interdependencies across genres and across time. In the first part of our study, we use BERTopic and an LLM model fine‐tuned for nineteenth‐century Danish literary language to present independent and combined topic models of the two corpuses. This approach supports multi‐scalar analysis of intertextual elements within and across these corpuses, thereby implementing a method for macroscopic reading. In the second part of the study, we develop a series of networked representations of the dependencies between fairy tales, where these dependencies are generated on the basis of the shared intertextual topic space of the fairy tales and the travel writing.
期刊介绍:
Orbis Litterarum is an international journal devoted to the study of European, American and related literature. Orbis Litterarum publishes peer reviewed, original articles on matters of general and comparative literature, genre and period, as well as analyses of specific works bearing on issues of literary theory and literary history.