首页 > 最新文献

Journal of Cultural Analytics最新文献

英文 中文
On Organizing a Shared Task for the Digital Humanities – Conclusions and Future Paths 组织数字人文学科的共享任务——结论与未来路径
Q1 Arts and Humanities Pub Date : 2021-12-13 DOI: 10.22148/001c.30697
Evelyn Gius, M. Willand, Nils Reiter
Sharedtasksareaworkformatprevalentinthenaturallanguageprocessingandmachinelearningcommunity. Thisspecial issue continues the reporting on the shared task SANTA (Systematic Analysis of Narrative levels Through Annotation), which has the development of annotation guidelines for narrative levels as its goal. Narrative levels, also known as embedded narrations, are omnipresent in many kinds of narrations, and one of the core concepts of narratology. In this introduction, we summarize the current state, report on the second annotation round in SANTA, draw some conclusions and, finally, derive some recommendations for future shared tasks in the digital humanities.
共享任务是一种普遍存在于语言处理和机器学习社区中的工作形式。本特刊继续报道共同任务SANTA(通过注释对叙事层面进行系统分析),该任务以制定叙事层面的注释指南为目标。叙事层次,又称嵌入叙事,在多种叙事中无所不在,是叙事学的核心概念之一。在这篇引言中,我们总结了当前的状况,报告了SANTA的第二轮注释,得出了一些结论,并最终为数字人文领域未来的共享任务提出了一些建议。
{"title":"On Organizing a Shared Task for the Digital Humanities – Conclusions and Future Paths","authors":"Evelyn Gius, M. Willand, Nils Reiter","doi":"10.22148/001c.30697","DOIUrl":"https://doi.org/10.22148/001c.30697","url":null,"abstract":"Sharedtasksareaworkformatprevalentinthenaturallanguageprocessingandmachinelearningcommunity. Thisspecial issue continues the reporting on the shared task SANTA (Systematic Analysis of Narrative levels Through Annotation), which has the development of annotation guidelines for narrative levels as its goal. Narrative levels, also known as embedded narrations, are omnipresent in many kinds of narrations, and one of the core concepts of narratology. In this introduction, we summarize the current state, report on the second annotation round in SANTA, draw some conclusions and, finally, derive some recommendations for future shared tasks in the digital humanities.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43690253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On the Theory of Narrative Levels and Their Annotation in the Digital Context 论数字语境下的叙事层次理论及其注释
Q1 Arts and Humanities Pub Date : 2021-12-13 DOI: 10.22148/001c.30700
Nora Ketschik, Benjamin Krautter, Sandra Murr, Yvonne Zimmermann
The article was written in the context of a Shared Task on the Analysis of Narrative levels Through Annotation (“SANTA”) which was published as a first draft in 2019. This revised version is based on further discussion on the formalization of the narratological concept of ‘narrative level.’ We firstly discuss the theory of narrative levels in literary studies, secondly derive features for the identification of narrative levels and finally develop guidelines for their annotation. An essential finding of the theoreticalworkliesinconnectingtheconceptof‘narrativelevel’tothenarrator. Byidentifyingdifferenttypesofnarrators,we are able to enumerate and categorize different scenarios for the emergence of new levels in narrative texts. Hereby, the article does not remain restricted to prototypical cases, but also deals with rare and problematic cases. Overall, our goal is to provide a theoretical reflection on narrative levels and to create accurate guidelines for its recognition. The method of approaching the phenomenon through annotation has proven to be extremely fruitful particularly in identifying the boundaries of the narrative levels.
这篇文章是在2019年作为初稿出版的《通过注释分析叙事层次的共享任务》(“SANTA”)的背景下撰写的。这一修订版本是基于对“叙事层次”这一叙事学概念形式化的进一步讨论。本文首先讨论文学研究中的叙事层次理论,然后推导出叙事层次识别的特征,最后提出叙事层次注释的指导原则。理论工作的一个重要发现是将“叙事层面”的概念与叙述者联系起来。通过识别不同类型的叙述者,我们能够列举和分类叙事文本中出现新层次的不同场景。因此,本文不再局限于典型案例,而是涉及罕见和有问题的案例。总的来说,我们的目标是提供对叙事层面的理论反思,并为其识别创造准确的指导方针。事实证明,通过注释来接近这种现象的方法是非常富有成效的,特别是在确定叙事层次的边界方面。
{"title":"On the Theory of Narrative Levels and Their Annotation in the Digital Context","authors":"Nora Ketschik, Benjamin Krautter, Sandra Murr, Yvonne Zimmermann","doi":"10.22148/001c.30700","DOIUrl":"https://doi.org/10.22148/001c.30700","url":null,"abstract":"The article was written in the context of a Shared Task on the Analysis of Narrative levels Through Annotation (“SANTA”) which was published as a first draft in 2019. This revised version is based on further discussion on the formalization of the narratological concept of ‘narrative level.’ We firstly discuss the theory of narrative levels in literary studies, secondly derive features for the identification of narrative levels and finally develop guidelines for their annotation. An essential finding of the theoreticalworkliesinconnectingtheconceptof‘narrativelevel’tothenarrator. Byidentifyingdifferenttypesofnarrators,we are able to enumerate and categorize different scenarios for the emergence of new levels in narrative texts. Hereby, the article does not remain restricted to prototypical cases, but also deals with rare and problematic cases. Overall, our goal is to provide a theoretical reflection on narrative levels and to create accurate guidelines for its recognition. The method of approaching the phenomenon through annotation has proven to be extremely fruitful particularly in identifying the boundaries of the narrative levels.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46633623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Narrative Boundaries Annotation Guide 叙事边界注释指南
Q1 Arts and Humanities Pub Date : 2021-12-13 DOI: 10.22148/001c.30698
Joshua D. Eisenberg, Mark A. Finlayson
It is rare for a story to have only a single narrative level: in fact, even the simplest stories usually contain multiple nested stories. The following is an annotation guide for encoding the boundaries between narrative levels, and which has been validated on modern fiction and TV scripts. We provided definitions of and give instructions for annotating information about each narrative level: embedded narratives, interruptive narrative, flashbacks (analepsis), and flashforwards (prolepsis). This annotation schema can be used for many types of narratological and computational research, however our intention in developing it was to lay the foundation for training computers to automatically extract narrative levels from long text.
很少有故事只有一个叙述关卡:事实上,即使是最简单的故事通常也包含多个嵌套的故事。以下是对叙事层次边界进行编码的注释指南,并在现代小说和电视剧剧本中得到了验证。我们提供了每个叙事层次的定义并给出了注释信息的说明:嵌入式叙事、中断叙事、闪回(睡眠)和闪前(预言)。这种注释模式可以用于许多类型的叙事学和计算研究,但是我们开发它的目的是为训练计算机从长文本中自动提取叙事层次奠定基础。
{"title":"Narrative Boundaries Annotation Guide","authors":"Joshua D. Eisenberg, Mark A. Finlayson","doi":"10.22148/001c.30698","DOIUrl":"https://doi.org/10.22148/001c.30698","url":null,"abstract":"It is rare for a story to have only a single narrative level: in fact, even the simplest stories usually contain multiple nested stories. The following is an annotation guide for encoding the boundaries between narrative levels, and which has been validated on modern fiction and TV scripts. We provided definitions of and give instructions for annotating information about each narrative level: embedded narratives, interruptive narrative, flashbacks (analepsis), and flashforwards (prolepsis). This annotation schema can be used for many types of narratological and computational research, however our intention in developing it was to lay the foundation for training computers to automatically extract narrative levels from long text.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47064309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Annotation Guideline No. 7 (revised): Guidelines for annotation of narrative structure 注释指南第7号(修订):叙述结构注释指南
Q1 Arts and Humanities Pub Date : 2021-12-13 DOI: 10.22148/001c.30703
Mats Wirén, Adam Ek
Analysis of narrative structure can be said to answer the question “Who tells what, and how?”. The key part of our annotation scheme is related to the “who?”, and to this end we distinguish between narration and fictional dialogue. Furthermore, with respect to the latter we keep track of turns, lines, identities of speakers and addressees, and speech-framing constructions, which provide the narrator’s cues about the circumstances of the speech. We also annotate voice, that is, whether the narrator is ever present in the story or not. Our annotation of the “what?” includes embeddings of narrative transmission levels to capture stories in stories, and embeddings of fictional dialogue to capture characters quoting other characters. Our annotation of the “how?” includes focalization, that is, the perspective from which the narrative is seen and how much information the narrator has access to.
对叙事结构的分析可以说回答了“谁告诉了什么,如何告诉?”的问题。我们的注释方案的关键部分与“谁”有关,为此我们区分了叙事和虚构对话。此外,对于后者,我们跟踪转弯、台词、说话人和受话人的身份以及言语框架结构,这些结构为叙述者提供了关于言语环境的线索。我们也会注释声音,也就是说,叙述者是否出现在故事中。我们对“什么?”的注释包括嵌入叙事传递水平,以捕捉故事中的故事,以及嵌入虚构对话,以捕捉引用其他角色的角色。我们对“如何?”的注释包括聚焦,即从什么角度看待叙事,以及叙述者可以获得多少信息。
{"title":"Annotation Guideline No. 7 (revised): Guidelines for annotation of narrative structure","authors":"Mats Wirén, Adam Ek","doi":"10.22148/001c.30703","DOIUrl":"https://doi.org/10.22148/001c.30703","url":null,"abstract":"Analysis of narrative structure can be said to answer the question “Who tells what, and how?”. The key part of our annotation scheme is related to the “who?”, and to this end we distinguish between narration and fictional dialogue. Furthermore, with respect to the latter we keep track of turns, lines, identities of speakers and addressees, and speech-framing constructions, which provide the narrator’s cues about the circumstances of the speech. We also annotate voice, that is, whether the narrator is ever present in the story or not. Our annotation of the “what?” includes embeddings of narrative transmission levels to capture stories in stories, and embeddings of fictional dialogue to capture characters quoting other characters. Our annotation of the “how?” includes focalization, that is, the perspective from which the narrative is seen and how much information the narrator has access to.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44390364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Annotation Guidelines for Narrative Levels 叙述关卡的注释指南
Q1 Arts and Humanities Pub Date : 2021-12-13 DOI: 10.22148/001c.30704
Adam Hammond
These guidelines present a minimalist set of instructions for annotating narrative levels. They strive for clarity and brevity, with a focus on clear examples and helpful rules of thumb. They present a one-sentence definition of a narrative and introduce the “Let me tell you a story” rule of thumb for determining whether a narrative level boundary has been crossed. Only four attributes are introduced: narrative number, narrative level, narrator ID, and whether a given narrative is left “open” or is closed.
这些指南提供了一套最低限度的说明,用于注释叙述级别。他们力求简洁明了,注重清晰的例子和有用的经验法则。他们提出了叙事的一句话定义,并引入了“让我告诉你一个故事”的经验法则,以确定是否跨越了叙事层面的界限。只引入了四个属性:叙事数量、叙事级别、叙事者ID,以及给定的叙事是“开放”的还是封闭的。
{"title":"Annotation Guidelines for Narrative Levels","authors":"Adam Hammond","doi":"10.22148/001c.30704","DOIUrl":"https://doi.org/10.22148/001c.30704","url":null,"abstract":"These guidelines present a minimalist set of instructions for annotating narrative levels. They strive for clarity and brevity, with a focus on clear examples and helpful rules of thumb. They present a one-sentence definition of a narrative and introduce the “Let me tell you a story” rule of thumb for determining whether a narrative level boundary has been crossed. Only four attributes are introduced: narrative number, narrative level, narrator ID, and whether a given narrative is left “open” or is closed.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47531517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Generative Dissensus of Reading the Feminist Novel, 1995-2020: A Computational Analysis of Interpretive Communities 女性主义小说阅读的代际分歧,1995-2020:解读群体的计算分析
Q1 Arts and Humanities Pub Date : 2021-11-19 DOI: 10.22148/001c.30009
Lisa Mendelman, Anna Mukamal
This article furthers ongoing work on the merits of the feminist novel’s intrinsic variability by probing its dynamics in four publishing contexts: contemporary anglophone literary criticism, prestigious review publications, marketing materials, and online book reviews by social readers. We explore how these interpretive communities converge and diverge in their assessments of feminist fiction over the past twenty-five years by evaluating articles from the MLA International Bibliography , book reviews in The New York Times, The New Yorker, Times Literary Supp-lement, and other prominent periodicals, blurbs from Amazon, and Goodreads reviews. We trace the feminist novel’s ambivalent fates—or rather, feminist novel s ’ ambivalent fates—in and across these four domains. To do so, we engage computational methods of topic modeling, most distinctive word analysis, and named entity recognition. We synthesize these quantitative results with qualitative attention to provocative examples from our corpus. In so doing, we consider how literary scholars can develop more robust understandings of what feminism and feminist fiction mean to contemporary readers and what we stand to gain by bringing this diverse interpretive labor into our scholarly conversations. Our synthetic interpretive approach reveals these communities’ shared topical investments in feminist fiction, though the communities talk about these topics in importantly different ways. Together, their discourse converges on two organizing concerns: embodied subjectivity and temporality. Different configurations of these aspects of personhood in time inform the communities’ vocabularies, their modes of self-address, the rationales they offer for reading feminist novels, and the forms of feminist subjectivity they promote. Our analysis thus demonstrates how novel reading can function as a mode of forging feminist knowledge and constructing feminist value
本文通过探讨女权主义小说在四种出版背景下的动态,进一步探讨了其内在可变性的优点:当代英语文学批评、著名评论出版物、营销材料和社会读者的在线书评。我们通过评估MLA国际书目、《纽约时报》、《纽约客》、《泰晤士报文学补充》和其他著名期刊的书评、亚马逊的简介和Goodreads评论,探讨了这些解释性群体在过去25年中对女权主义小说的评估是如何趋同和分化的。我们在这四个领域中追踪女权主义小说的矛盾命运——或者更确切地说,女权主义小说矛盾命运。为此,我们采用了主题建模、最具特色的单词分析和命名实体识别的计算方法。我们综合了这些定量的结果,并对语料库中的挑衅性例子给予了定性的关注。在这样做的过程中,我们考虑文学学者如何更有力地理解女权主义和女权主义小说对当代读者意味着什么,以及通过将这种多样的解释劳动纳入我们的学术对话,我们将获得什么。我们的综合解释方法揭示了这些社区对女权主义小说的共同主题投资,尽管社区以重要的不同方式谈论这些主题。他们的话语集中在两个组织关注点上:体现的主观性和时间性。这些人格方面在时间上的不同配置告知了社区的词汇、他们的自我称呼模式、他们为阅读女权主义小说提供的理由,以及他们提倡的女权主义主体性形式。因此,我们的分析表明,小说阅读是如何作为一种锻造女性主义知识和建构女性主义价值的模式发挥作用的
{"title":"The Generative Dissensus of Reading the Feminist Novel, 1995-2020: A Computational Analysis of Interpretive Communities","authors":"Lisa Mendelman, Anna Mukamal","doi":"10.22148/001c.30009","DOIUrl":"https://doi.org/10.22148/001c.30009","url":null,"abstract":"This article furthers ongoing work on the merits of the feminist novel’s intrinsic variability by probing its dynamics in four publishing contexts: contemporary anglophone literary criticism, prestigious review publications, marketing materials, and online book reviews by social readers. We explore how these interpretive communities converge and diverge in their assessments of feminist fiction over the past twenty-five years by evaluating articles from the MLA International Bibliography , book reviews in The New York Times, The New Yorker, Times Literary Supp-lement, and other prominent periodicals, blurbs from Amazon, and Goodreads reviews. We trace the feminist novel’s ambivalent fates—or rather, feminist novel s ’ ambivalent fates—in and across these four domains. To do so, we engage computational methods of topic modeling, most distinctive word analysis, and named entity recognition. We synthesize these quantitative results with qualitative attention to provocative examples from our corpus. In so doing, we consider how literary scholars can develop more robust understandings of what feminism and feminist fiction mean to contemporary readers and what we stand to gain by bringing this diverse interpretive labor into our scholarly conversations. Our synthetic interpretive approach reveals these communities’ shared topical investments in feminist fiction, though the communities talk about these topics in importantly different ways. Together, their discourse converges on two organizing concerns: embodied subjectivity and temporality. Different configurations of these aspects of personhood in time inform the communities’ vocabularies, their modes of self-address, the rationales they offer for reading feminist novels, and the forms of feminist subjectivity they promote. Our analysis thus demonstrates how novel reading can function as a mode of forging feminist knowledge and constructing feminist value","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48389653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Audiobook Stylistics: Comparing print and audio in the bestselling segment 有声书文体:比较畅销部分的纸质书和有声书
Q1 Arts and Humanities Pub Date : 2021-11-02 DOI: 10.22148/001c.29802
Karl Berglund, Mats Dahllöf
The paper explores differences between bestsellers in print and the most popular audiobooks in a subscription-based streaming service for books (“beststreamers”) by means of computational stylistics. The point of departure is the complete set of print bestsellers and digital audiobook beststreamers for the Swedish book market 2015–2019, in total 172 novels. We probed 34 linguistic measures to track differences between subsets at the stylistic level. The results indicate that there are pronounced differences between the formats. Print bestsellers are longer, syntactically more complex and varied, and seem to focus more on depiction. Beststreaming audiobooks, by contrast, are shorter, more straightforwardly written, and appear to highlight plot and dialogue. The results are replicated when the comparison is restricted to crime fiction, the most prominent genre in the commercial top segment. Given these results, it is argued that it is possible to discern a particular audiobook style as one factor affecting book consumption in digital formats, and conversely that the printed format is associated with other stylistic preferences.
本文通过计算文体学的方法探讨了基于订阅的流媒体图书服务(“beststreamers”)中最畅销的纸质书和最受欢迎的有声书之间的差异。本文的出发点是2015-2019年瑞典图书市场的完整的纸质畅销书和数字有声书最佳流媒体,共172本小说。我们探讨了34种语言测量方法,以跟踪子集之间在文体水平上的差异。结果表明,两种格式之间存在明显的差异。纸质畅销书篇幅更长,语法更复杂、更多样,而且似乎更注重描写。相比之下,流媒体上最好的有声书更短,写作更直接,似乎更突出情节和对话。同样的结果也适用于犯罪小说,这是商业小说中最突出的类型。鉴于这些结果,有人认为有可能将特定的有声书风格视为影响数字格式图书消费的一个因素,相反,印刷格式与其他风格偏好有关。
{"title":"Audiobook Stylistics: Comparing print and audio in the bestselling segment","authors":"Karl Berglund, Mats Dahllöf","doi":"10.22148/001c.29802","DOIUrl":"https://doi.org/10.22148/001c.29802","url":null,"abstract":"The paper explores differences between bestsellers in print and the most popular audiobooks in a subscription-based streaming service for books (“beststreamers”) by means of computational stylistics. The point of departure is the complete set of print bestsellers and digital audiobook beststreamers for the Swedish book market 2015–2019, in total 172 novels. We probed 34 linguistic measures to track differences between subsets at the stylistic level. The results indicate that there are pronounced differences between the formats. Print bestsellers are longer, syntactically more complex and varied, and seem to focus more on depiction. Beststreaming audiobooks, by contrast, are shorter, more straightforwardly written, and appear to highlight plot and dialogue. The results are replicated when the comparison is restricted to crime fiction, the most prominent genre in the commercial top segment. Given these results, it is argued that it is possible to discern a particular audiobook style as one factor affecting book consumption in digital formats, and conversely that the printed format is associated with other stylistic preferences.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45126415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Celebrating 5 Years of Cultural Analytics 庆祝文化分析5周年
Q1 Arts and Humanities Pub Date : 2021-09-14 DOI: 10.22148/001c.28215
A. Piper
Announcing our five year anniversary, we look forward to a robust future of more cultural analytics.
在宣布成立五周年之际,我们期待着更多文化分析的强劲未来。
{"title":"Celebrating 5 Years of Cultural Analytics","authors":"A. Piper","doi":"10.22148/001c.28215","DOIUrl":"https://doi.org/10.22148/001c.28215","url":null,"abstract":"Announcing our five year anniversary, we look forward to a robust future of more cultural analytics.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44893236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Measure of the Archive: The Ro­bustness of Network Analysis in Early Modern Correspondence 档案的尺度:现代早期通信网络分析的Ro-bustness
Q1 Arts and Humanities Pub Date : 2021-07-21 DOI: 10.22148/001C.25943
Y. Ryan, S. Ahnert
Network analysis of historical correspondence can be a fruitful way to address historical research questions, and has been increasingly used in historical studies over the past decade. As with many areas of quantitative humanities research, the reliability of the results are often called into question, given that such approaches require ’hard data’ as input, yet almost inevitably use datasets with partial or missing records. Other disciplines using network analysis have conducted robustness experiments designed to test the impact of data loss or error on their results. In order to test how this missing data might affect our own area of research, we conducted a number of experiments designed to simulate the impact of the kinds of loss often seen in historical correspondence data, including random document loss, missing years, and errors in the disambiguation and de-duplication process. The results show that most network centrality measures maintain robustness until a very large proportion of the data (60% or more) is removed. Some measures showed a linear change in robustness, while others remained high and then fell off sharply. Only one, transitivity (local clustering coefficient) was significantly impacted throughout. We tested a range of data loss scenarios (random single letters, folio books of manuscript letters, catalogues, and entire years) and a range of commonly used network metrics. In addition, we tested the robustness of more complex network analysis results in the literature that combine several network metrics to highlight individuals in the network, and found that the same types of individuals would have likely been highlighted even with 50% random letter loss. Alongside the article is a web application, built using Shiny, which will calculate robustness measures for a user-uploaded network dataset. We conclude that researchers working with similar historical correspondence datasets might be able to consider network analysis results to be robust in most cases, rather than work on the assumption that missing data would lead to very different findings or results.
历史对应的网络分析是解决历史研究问题的一种富有成效的方法,在过去十年中越来越多地用于历史研究。与定量人文研究的许多领域一样,研究结果的可靠性经常受到质疑,因为这种方法需要“硬数据”作为输入,但几乎不可避免地使用了部分或缺失记录的数据集。其他使用网络分析的学科进行了鲁棒性实验,旨在测试数据丢失或错误对其结果的影响。为了测试这些丢失的数据如何影响我们自己的研究领域,我们进行了许多实验,旨在模拟历史通信数据中常见的各种丢失的影响,包括随机文档丢失、丢失年份以及消歧和重复数据删除过程中的错误。结果表明,大多数网络中心性度量在很大一部分数据(60%或更多)被删除之前保持鲁棒性。一些指标显示出稳健性的线性变化,而另一些指标则保持高位,然后急剧下降。只有一个,传递性(局部聚类系数)在整个过程中受到显著影响。我们测试了一系列数据丢失场景(随机单个字母、手稿信件的对开本、目录和整个年份)和一系列常用的网络指标。此外,我们测试了文献中更复杂的网络分析结果的稳健性,这些结果结合了几个网络指标来突出网络中的个体,并发现即使有50%的随机字母丢失,相同类型的个体也可能被突出显示。本文附带了一个使用Shiny构建的web应用程序,它将为用户上传的网络数据集计算健壮性度量。我们的结论是,研究人员使用类似的历史通信数据集,可能能够考虑网络分析结果在大多数情况下是稳健的,而不是假设缺失的数据会导致非常不同的发现或结果。
{"title":"The Measure of the Archive: The Ro­bustness of Network Analysis in Early Modern Correspondence","authors":"Y. Ryan, S. Ahnert","doi":"10.22148/001C.25943","DOIUrl":"https://doi.org/10.22148/001C.25943","url":null,"abstract":"Network analysis of historical correspondence can be a fruitful way to address historical research questions, and has been increasingly used in historical studies over the past decade. As with many areas of quantitative humanities research, the reliability of the results are often called into question, given that such approaches require ’hard data’ as input, yet almost inevitably use datasets with partial or missing records. Other disciplines using network analysis have conducted robustness experiments designed to test the impact of data loss or error on their results. In order to test how this missing data might affect our own area of research, we conducted a number of experiments designed to simulate the impact of the kinds of loss often seen in historical correspondence data, including random document loss, missing years, and errors in the disambiguation and de-duplication process. The results show that most network centrality measures maintain robustness until a very large proportion of the data (60% or more) is removed. Some measures showed a linear change in robustness, while others remained high and then fell off sharply. Only one, transitivity (local clustering coefficient) was significantly impacted throughout. We tested a range of data loss scenarios (random single letters, folio books of manuscript letters, catalogues, and entire years) and a range of commonly used network metrics. In addition, we tested the robustness of more complex network analysis results in the literature that combine several network metrics to highlight individuals in the network, and found that the same types of individuals would have likely been highlighted even with 50% random letter loss. Alongside the article is a web application, built using Shiny, which will calculate robustness measures for a user-uploaded network dataset. We conclude that researchers working with similar historical correspondence datasets might be able to consider network analysis results to be robust in most cases, rather than work on the assumption that missing data would lead to very different findings or results.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44538159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Chance Encounters: World Literature Between the Unexpected and the Probable 偶然相遇:介于意外与可能之间的世界文学
Q1 Arts and Humanities Pub Date : 2021-07-09 DOI: 10.22148/001C.25525
Hoyt Long
This essay brings probabilistic reasoning into concerted dialogue with book-historical and sociological approaches to world literature. Using extensive bibliographic data about literary translations into Japanese during the modern era, it develops a series of case studies at interrelated scales—the literary anthology, world library collections, and individual readers—to reason about the likelihood of certain authors or works being plucked from the swirling currents of the global traffic in books. At each scale, I consider how such data might inform the interpretations we give to the choice of one author over another in a given context. Woven into these case studies is an extended reflection on the history of probabilistic reasoning from the late-eighteenth century to the late-twentieth. What, this essay ultimately asks, might literary historians gain from taking this history seriously in our own appeals to chance as a form of historical explanation?
本文将概率推理与世界文学的历史和社会学方法结合起来。利用现代文学翻译成日语的大量书目数据,它开发了一系列相互关联的案例研究——文学选集、世界图书馆收藏和个人读者——以推断某些作者或作品是否有可能从全球图书流通的漩涡中被提取。在每一个尺度上,我都会考虑这些数据如何为我们在特定背景下选择一位作者而不是另一位作者提供解释。融入这些案例研究的是对十八世纪末至二十世纪末概率推理历史的延伸反思。这篇文章最终问道,文学历史学家在我们自己呼吁将偶然性作为一种历史解释的形式时,认真对待这段历史,会得到什么?
{"title":"Chance Encounters: World Literature Between the Unexpected and the Probable","authors":"Hoyt Long","doi":"10.22148/001C.25525","DOIUrl":"https://doi.org/10.22148/001C.25525","url":null,"abstract":"This essay brings probabilistic reasoning into concerted dialogue with book-historical and sociological approaches to world literature. Using extensive bibliographic data about literary translations into Japanese during the modern era, it develops a series of case studies at interrelated scales—the literary anthology, world library collections, and individual readers—to reason about the likelihood of certain authors or works being plucked from the swirling currents of the global traffic in books. At each scale, I consider how such data might inform the interpretations we give to the choice of one author over another in a given context. Woven into these case studies is an extended reflection on the history of probabilistic reasoning from the late-eighteenth century to the late-twentieth. What, this essay ultimately asks, might literary historians gain from taking this history seriously in our own appeals to chance as a form of historical explanation?","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49174131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Journal of Cultural Analytics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1