首页 > 最新文献

Journal of Cultural Analytics最新文献

英文 中文
Theory-Driven Statistics for the Digital Humanities: Presenting Pitfalls and a Practical Guide by the Example of the Reformation 数字人文学科的理论统计:以改革为例的陷阱与实践指南
Q1 Arts and Humanities Pub Date : 2022-12-28 DOI: 10.22148/001c.57764
Ramona Roller
The Digital Humanities face the problem of multiple hypothesis testing: Evermore hypotheses are tested until a desired pattern has been found. This practice is prone to mistaking random patterns for real ones. Instead, we should reduce the number of hypothesis tests to only test meaningful ones. We address this problem by using theory to generate hypotheses for statistical models. We illustrate our approach with the example of the European Reformation, where we test a theory on the role of opinion leaders for the adoption of Protestantism with a logistic regression model. Given our specific setting, including choice of data and operationalisation of variables, we do not find enough evidence to claim that opinion leaders contributed via personal visits and letters to the adoption of Protestantism. To falsify or to support a theory, it has to be tested in different settings. Our presented approach helps the Digital Humanities bridge the gap between the qualitative and quantitative camp, advance understanding of structures resulting from human activity, and increase scientific credibility.
数字人文面临着多重假设检验的问题:对越来越多的假设进行检验,直到找到所需的模式。这种做法容易将随机模式误认为真实模式。相反,我们应该减少假设检验的数量,只检验有意义的假设检验。我们通过使用理论为统计模型生成假设来解决这个问题。我们以欧洲宗教改革为例说明了我们的方法,在那里,我们用逻辑回归模型检验了一个关于意见领袖在采用新教方面的作用的理论。考虑到我们的具体环境,包括数据的选择和变量的操作,我们没有找到足够的证据来声称意见领袖通过私人访问和信件为新教的采用做出了贡献。为了证伪或支持一个理论,必须在不同的环境中进行测试。我们提出的方法有助于数字人文弥合定性和定量阵营之间的差距,促进对人类活动产生的结构的理解,并提高科学可信度。
{"title":"Theory-Driven Statistics for the Digital Humanities: Presenting Pitfalls and a Practical Guide by the Example of the Reformation","authors":"Ramona Roller","doi":"10.22148/001c.57764","DOIUrl":"https://doi.org/10.22148/001c.57764","url":null,"abstract":"The Digital Humanities face the problem of multiple hypothesis testing: Evermore hypotheses are tested until a desired pattern has been found. This practice is prone to mistaking random patterns for real ones. Instead, we should reduce the number of hypothesis tests to only test meaningful ones. We address this problem by using theory to generate hypotheses for statistical models. We illustrate our approach with the example of the European Reformation, where we test a theory on the role of opinion leaders for the adoption of Protestantism with a logistic regression model. Given our specific setting, including choice of data and operationalisation of variables, we do not find enough evidence to claim that opinion leaders contributed via personal visits and letters to the adoption of Protestantism. To falsify or to support a theory, it has to be tested in different settings. Our presented approach helps the Digital Humanities bridge the gap between the qualitative and quantitative camp, advance understanding of structures resulting from human activity, and increase scientific credibility.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42057354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conceptual Forays: A Corpus-based Study of “Theory” in Digital Humanities Journals 概念探索:基于语料库的数字人文期刊“理论”研究
Q1 Arts and Humanities Pub Date : 2022-12-19 DOI: 10.22148/001c.55507
Rabea Kleymann, A. Niekler, M. Burghardt
The status of theory in the Digital Humanities (DH) has been the subject of much debate. As a result, we find different theory narratives competing and entangled with each other. If at all, these narratives can only be grasped and examined from a somewhat detached perspective. Here, we attempt to investigate these elusive narratives by means of a conceptual history approach. In doing so, we define different theory dimensions, ranging from specific cultural and literary theory frameworks to more generic uses of the concept of theory. We examine the use and semantic changes of these theory notions in a large corpus of DH journals. Using a mixture of heuristic methods and approaches from the field of distributional semantics, we aim to create tellable conceptual stories of DH theory.
理论在数字人文学科中的地位一直是争论的焦点。因此,我们发现不同的理论叙事相互竞争和纠缠。如果有的话,这些叙述只能从一个有点超然的角度来把握和审视。在这里,我们试图通过概念历史的方法来研究这些难以捉摸的叙事。在这样做的过程中,我们定义了不同的理论维度,从特定的文化和文学理论框架到理论概念的更通用的使用。我们在大量DH期刊语料库中研究了这些理论概念的使用和语义变化。使用启发式方法和分布语义领域的方法的混合,我们旨在创建DH理论的可计数概念故事。
{"title":"Conceptual Forays: A Corpus-based Study of “Theory” in Digital Humanities Journals","authors":"Rabea Kleymann, A. Niekler, M. Burghardt","doi":"10.22148/001c.55507","DOIUrl":"https://doi.org/10.22148/001c.55507","url":null,"abstract":"The status of theory in the Digital Humanities (DH) has been the subject of much debate. As a result, we find different theory narratives competing and entangled with each other. If at all, these narratives can only be grasped and examined from a somewhat detached perspective. Here, we attempt to investigate these elusive narratives by means of a conceptual history approach. In doing so, we define different theory dimensions, ranging from specific cultural and literary theory frameworks to more generic uses of the concept of theory. We examine the use and semantic changes of these theory notions in a large corpus of DH journals. Using a mixture of heuristic methods and approaches from the field of distributional semantics, we aim to create tellable conceptual stories of DH theory.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":"226 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41266401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Concepts to Texts and Back: Operationalization as a Core Activity of Digital Humanities 从概念到文本再到背后:作为数字人文核心活动的操作
Q1 Arts and Humanities Pub Date : 2022-12-08 DOI: 10.22148/001c.57195
Axel Pichler, Nils Reiter
This article puts operationalization as a research practice and its theoretical consequences into focus. As all sciences as well as humanities areas use concepts to describe their realm of investigation, digital humanities projects are usually faced with the challenge of ‘bridging the gap’ from theoretical concepts (whose meaning(s) depend on a certain theory and which are used to describe expectations, hypothesis and results) to results derived from data. The process of developing methods to bridge this gap is called ‘operationalization’, and it is a common task for any kind of quantitative, formal, or digital analysis. Furthermore, operationalization choices have long-lasting consequences, as they (obviously) influence the results that can be achieved, and, in turn, the possibilities to interpret these results in terms of the original research question. However, even though this process is so important and so common, its theoretical consequences are rarely reflected. Because the concepts that are operationalized cannot be operationalized in isolation, operationalizing is not only an engineering or implementation challenge, but touches on the theoretical core of the research questions we work on, and the fields we work in. In this article, we first clarify the need to operationalize on selected, representative examples, situate the process within typical DH workflows, and highlight the consequences that operationalization decisions have. We will then argue that operationalization plays such a crucial role for the digital humanities that any kind of theory needs to take off from operationalization practices. Based on these assumptions, we will develop a first scheme of the constraints and necessities of such a theory and reflect their epistemic consequences.
本文将操作性化作为一种研究实践及其理论后果作为重点。由于所有的科学和人文领域都使用概念来描述他们的研究领域,数字人文项目通常面临着从理论概念(其含义依赖于某个理论,用于描述期望、假设和结果)到数据得出的结果之间“弥合差距”的挑战。开发方法来弥合这一差距的过程被称为“操作化”,这是任何一种定量、正式或数字分析的共同任务。此外,操作化选择具有持久的影响,因为它们(显然)影响可以实现的结果,反过来,根据原始研究问题解释这些结果的可能性。然而,尽管这一过程如此重要和普遍,但其理论后果却很少得到反映。因为操作化的概念不能孤立地操作化,操作化不仅是一个工程或实施的挑战,而且涉及我们所从事的研究问题的理论核心,以及我们所从事的领域。在本文中,我们首先阐明了对选定的代表性示例进行操作的必要性,将流程置于典型的DH工作流中,并强调了操作决策所具有的后果。然后,我们将论证操作化对数字人文学科起着至关重要的作用,任何一种理论都需要从操作化实践中起飞。基于这些假设,我们将发展这种理论的约束和必要性的第一个方案,并反映他们的认识结果。
{"title":"From Concepts to Texts and Back: Operationalization as a Core Activity of Digital Humanities","authors":"Axel Pichler, Nils Reiter","doi":"10.22148/001c.57195","DOIUrl":"https://doi.org/10.22148/001c.57195","url":null,"abstract":"This article puts operationalization as a research practice and its theoretical consequences into focus. As all sciences as well as humanities areas use concepts to describe their realm of investigation, digital humanities projects are usually faced with the challenge of ‘bridging the gap’ from theoretical concepts (whose meaning(s) depend on a certain theory and which are used to describe expectations, hypothesis and results) to results derived from data. The process of developing methods to bridge this gap is called ‘operationalization’, and it is a common task for any kind of quantitative, formal, or digital analysis. Furthermore, operationalization choices have long-lasting consequences, as they (obviously) influence the results that can be achieved, and, in turn, the possibilities to interpret these results in terms of the original research question. However, even though this process is so important and so common, its theoretical consequences are rarely reflected. Because the concepts that are operationalized cannot be operationalized in isolation, operationalizing is not only an engineering or implementation challenge, but touches on the theoretical core of the research questions we work on, and the fields we work in. In this article, we first clarify the need to operationalize on selected, representative examples, situate the process within typical DH workflows, and highlight the consequences that operationalization decisions have. We will then argue that operationalization plays such a crucial role for the digital humanities that any kind of theory needs to take off from operationalization practices. Based on these assumptions, we will develop a first scheme of the constraints and necessities of such a theory and reflect their epistemic consequences.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43376490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Grounding Theory in Digital Data: A Methodological Approach for a Reflective Procedural Framework 数字数据中的基础理论:一种反思性程序框架的方法论方法
Q1 Arts and Humanities Pub Date : 2022-12-08 DOI: 10.22148/001c.57197
A. Bischof, Konstantin Freybe
Instead of looking for new paradigms for Digital Humanities (DH), we present Grounded Theory Methodology (GTM) as a methodological approach to frame digital research practices more reflectively. By turning to the epistemological and practical implications of digital tools like Topic Modeling and digital data sources like YouTube comments, we highlight the theoretical assumptions that are already in the game—and call for more explicitness and methodical monitoring. To explain the procedures of GTM and the proposed worth for DH, we present an example of a qualitative research project using machine learning techniques to narrow down a large scale of data to human interpretable resample. The methodically monitored resampling process provided valuable means to validly minimize the amount of data without losing a qualitative trajectory of the process itself. Defining and tracing relevant content in our original data set enabled us to find related comments and textual conversations to be analyzed further. We discuss the example iteration in two ways: Our prototype and procedure show on the one hand, how qualitative research and computational methods can be better intertwined without compromising their epistemological foundations. On the other hand, we argue for an understanding of DH as research practice, that should follow an abductive research agenda in order to ground its theories in data.
我们没有为数字人文(DH)寻找新的范式,而是将基础理论方法论(GTM)作为一种方法论方法,以更具反思性地构建数字研究实践。通过转向主题建模等数字工具和YouTube评论等数字数据源的认识论和实践意义,我们强调了游戏中已经存在的理论假设,并呼吁更加明确和有条理的监控。为了解释GTM的过程和DH的拟议价值,我们提供了一个定性研究项目的例子,该项目使用机器学习技术将大规模数据缩小到人类可解释的重采样。系统监控的重新采样过程提供了有价值的手段,可以有效地减少数据量,而不会丢失过程本身的定性轨迹。在我们的原始数据集中定义和跟踪相关内容使我们能够找到相关的评论和文本对话,以便进一步分析。我们以两种方式讨论示例迭代:我们的原型和程序一方面表明,定性研究和计算方法如何在不损害其认识论基础的情况下更好地交织在一起。另一方面,我们主张将DH理解为一种研究实践,应该遵循溯因研究议程,以便将其理论建立在数据基础上。
{"title":"Grounding Theory in Digital Data: A Methodological Approach for a Reflective Procedural Framework","authors":"A. Bischof, Konstantin Freybe","doi":"10.22148/001c.57197","DOIUrl":"https://doi.org/10.22148/001c.57197","url":null,"abstract":"Instead of looking for new paradigms for Digital Humanities (DH), we present Grounded Theory Methodology (GTM) as a methodological approach to frame digital research practices more reflectively. By turning to the epistemological and practical implications of digital tools like Topic Modeling and digital data sources like YouTube comments, we highlight the theoretical assumptions that are already in the game—and call for more explicitness and methodical monitoring. To explain the procedures of GTM and the proposed worth for DH, we present an example of a qualitative research project using machine learning techniques to narrow down a large scale of data to human interpretable resample. The methodically monitored resampling process provided valuable means to validly minimize the amount of data without losing a qualitative trajectory of the process itself. Defining and tracing relevant content in our original data set enabled us to find related comments and textual conversations to be analyzed further. We discuss the example iteration in two ways: Our prototype and procedure show on the one hand, how qualitative research and computational methods can be better intertwined without compromising their epistemological foundations. On the other hand, we argue for an understanding of DH as research practice, that should follow an abductive research agenda in order to ground its theories in data.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49579988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are Computational Literary Studies Structuralist? 计算文学研究是结构主义的吗?
Q1 Arts and Humanities Pub Date : 2022-12-01 DOI: 10.22148/001c.46662
Evelyn Gius, Janina Jacke
In this contribution we discuss what we call the “digital humanities-as-structuralism” narrative for the case of computational literary studies. To better understand the entailed criticism, we start with some background for the non-computational aspects in this narrative. First, we single out major criticisms against structuralism. We then introduce a general and theory-independent model of literary text analysis and discuss hypothesis development and justification in literary studies. This builds the ground for our analysis of structuralism criticisms in computational literary studies. In our discussion of the “digital humanities-as-structuralism” narrative, we examine the use of computational methods for the exploration and confirmation of interpretation hypotheses and its potential relation to structuralist issues. We argue that the “digital humanities-as-structuralism” narrative may be productive where it cautions against reductionist approaches, but it is not appropriate for describing exploratory or partial approaches and the presentation of their findings. There, the computational approaches should rather be seen as enabling connectivity and fostering the joint endeavor of understanding.
在这篇文章中,我们讨论了计算文学研究中我们所说的“作为结构主义的数字人文”叙事。为了更好地理解随之而来的批评,我们从这个叙述中非计算方面的一些背景开始。首先,我们挑出对结构主义的主要批评。然后,我们介绍了一个通用的、独立于理论的文学文本分析模型,并讨论了文学研究中的假设发展和论证。这为我们分析计算文学研究中的结构主义批评奠定了基础。在我们对“作为结构主义的数字人文”叙事的讨论中,我们考察了计算方法在探索和确认解释假设及其与结构主义问题的潜在关系方面的使用。我们认为,“作为结构主义的数字人文”叙事可能是富有成效的,因为它警告不要采用还原主义方法,但它不适合描述探索性或局部性方法及其发现的呈现。在那里,计算方法应该被视为实现连接和促进理解的共同努力。
{"title":"Are Computational Literary Studies Structuralist?","authors":"Evelyn Gius, Janina Jacke","doi":"10.22148/001c.46662","DOIUrl":"https://doi.org/10.22148/001c.46662","url":null,"abstract":"In this contribution we discuss what we call the “digital humanities-as-structuralism” narrative for the case of computational literary studies. To better understand the entailed criticism, we start with some background for the non-computational aspects in this narrative. First, we single out major criticisms against structuralism. We then introduce a general and theory-independent model of literary text analysis and discuss hypothesis development and justification in literary studies. This builds the ground for our analysis of structuralism criticisms in computational literary studies. In our discussion of the “digital humanities-as-structuralism” narrative, we examine the use of computational methods for the exploration and confirmation of interpretation hypotheses and its potential relation to structuralist issues. We argue that the “digital humanities-as-structuralism” narrative may be productive where it cautions against reductionist approaches, but it is not appropriate for describing exploratory or partial approaches and the presentation of their findings. There, the computational approaches should rather be seen as enabling connectivity and fostering the joint endeavor of understanding.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49227660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sailing on Encrypted Seas: The Archive and Digital Memory in African and Diasporic Futurism 航行在加密的海洋:非洲和散居未来主义的档案和数字记忆
Q1 Arts and Humanities Pub Date : 2022-11-23 DOI: 10.22148/001c.55508
Amanda Furiasse
Digitization has commonly been marketed as a predictive technology that can enable humanity to intercede into the future. This faith in digital media’s prophetic powers, however, obfuscates the fact that digitization is unavoidably stuck in the past. In effect, digitization transforms the past into highly mutable and volatile data sets that are persistently rewritten by computer’s memory refresh circuits. While some lament this temporal incongruity as problematic to the archival process, African and diasporic futurist artists are utilizing digital distortion as an opportunity to emplace the archival process within the sea and reimagine the archive as an impermanent, transitory, and fluid practice that has the capacity to usher in a more culturally and scientifically nuanced understanding of memory. This article explores the capacity of the sea to reorientate digital humanities scholarship around the cyclical interplay between machinic, environmental, and human social systems and craft historiographical methods around envisioning a viable future for humanity.
数字化通常被宣传为一种预测技术,可以使人类对未来进行调解。然而,这种对数字媒体预言能力的信仰掩盖了一个事实,即数字化不可避免地停留在过去。实际上,数字化将过去转化为高度可变和不稳定的数据集,这些数据集被计算机的内存刷新电路不断重写。虽然有些人哀叹这种时间上的不协调对档案过程来说是有问题的,但非洲和散居的未来主义艺术家正在利用数字失真作为一个机会,将档案过程置于海洋之中,并将档案重新想象为一种短暂的、短暂的、流动的实践,它有能力引领对记忆的更文化和科学的细微理解。本文探讨了海洋围绕机械、环境和人类社会系统之间的周期性相互作用重新定位数字人文学术的能力,以及围绕人类可行未来设想的工艺历史方法。
{"title":"Sailing on Encrypted Seas: The Archive and Digital Memory in African and Diasporic Futurism","authors":"Amanda Furiasse","doi":"10.22148/001c.55508","DOIUrl":"https://doi.org/10.22148/001c.55508","url":null,"abstract":"Digitization has commonly been marketed as a predictive technology that can enable humanity to intercede into the future. This faith in digital media’s prophetic powers, however, obfuscates the fact that digitization is unavoidably stuck in the past. In effect, digitization transforms the past into highly mutable and volatile data sets that are persistently rewritten by computer’s memory refresh circuits. While some lament this temporal incongruity as problematic to the archival process, African and diasporic futurist artists are utilizing digital distortion as an opportunity to emplace the archival process within the sea and reimagine the archive as an impermanent, transitory, and fluid practice that has the capacity to usher in a more culturally and scientifically nuanced understanding of memory. This article explores the capacity of the sea to reorientate digital humanities scholarship around the cyclical interplay between machinic, environmental, and human social systems and craft historiographical methods around envisioning a viable future for humanity.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47950608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Foreword to the Special Issue “Theorytellings: Epistemic Narratives in the Digital Humanities” “理论教育:数字人文中的认识叙事”特刊前言
Q1 Arts and Humanities Pub Date : 2022-11-23 DOI: 10.22148/001c.55593
Rabea Kleymann, M. Burghardt, Jonathan D. Geiger, Mareike Schumacher
This special issue deals with existing theory narratives and conceptions in DH scholarship. Introducing the neologism “theorytellings”, this special issue invites DH scholars to narrate and discuss their own theoretical contributions to the field.
本特刊探讨DH学术中现有的理论叙述和概念。本期特刊介绍新词“theorytellings”,邀请DH学者讲述和讨论他们自己对该领域的理论贡献。
{"title":"Foreword to the Special Issue “Theorytellings: Epistemic Narratives in the Digital Humanities”","authors":"Rabea Kleymann, M. Burghardt, Jonathan D. Geiger, Mareike Schumacher","doi":"10.22148/001c.55593","DOIUrl":"https://doi.org/10.22148/001c.55593","url":null,"abstract":"This special issue deals with existing theory narratives and conceptions in DH scholarship. Introducing the neologism “theorytellings”, this special issue invites DH scholars to narrate and discuss their own theoretical contributions to the field.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46902299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Biodiversity is not declining in fiction 生物多样性并没有在小说中下降
Q1 Arts and Humanities Pub Date : 2022-10-06 DOI: 10.22148/001c.38739
Andrew Piper
This paper attempts to replicate the findings of the recent work, “The rise and fall of biodiversity in literature,” by Langer et al. (2021). Using a large corpus from Project Gutenberg (N = ~15,000) and a dictionary-matching method of over 240K biological taxa, Langer et al. find that the frequency and diversity of biological taxa have been declining steadily since the first half of the nineteenth century, echoing prior work in cultural analytics. This paper applies the original paper’s three primary measures to two additional data sets along with the original dataset and compares their dictionary-based method with an alternative supervised machine learning method. I find that the trajectory of biological tokens in fiction in the new data sets is directionally opposite to that shown by Langer et al. independent of the methods used (i.e. taxa rise rather than fall since the first half of the nineteenth century) but that their breakpoint estimation appears largely robust within +/- 15 years. Based on this analysis, I suggest that the discrepancy between our results is due to corpus construction rather than choice of method. I find that only conditioning on fiction in the original dataset generates results more similar to the two alternative datasets used here. In addition to emphasizing the importance of corpus construction for cultural analytics, these findings also raise larger questions about the difficulties of interpreting lexical items as indeces of social attitudes, pointing to a need for future work.
本文试图复制Langer等人最近的工作“文献中生物多样性的兴衰”的发现。(2021)。使用古腾堡项目的大型语料库(N=~15000)和超过240K个生物分类群的字典匹配方法,Langer等人发现,自19世纪上半叶以来,生物分类群的频率和多样性一直在稳步下降,这与之前在文化分析方面的工作相呼应。本文将原始论文的三个主要度量与原始数据集一起应用于两个额外的数据集,并将其基于字典的方法与另一种监督机器学习方法进行比较。我发现,新数据集中小说中生物标记的轨迹与Langer等人所示的轨迹方向相反。独立于所使用的方法(即自19世纪上半叶以来分类群的上升而不是下降),但它们的断点估计在+/-15年内似乎很大程度上是稳健的。基于这一分析,我认为我们的结果之间的差异是由于语料库的构建,而不是方法的选择。我发现,只有在原始数据集中以虚构为条件,才会产生与此处使用的两个备选数据集更相似的结果。除了强调语料库构建对文化分析的重要性外,这些发现还提出了更大的问题,即将词汇项目解释为社会态度的独立因素的困难,指出了未来工作的必要性。
{"title":"Biodiversity is not declining in fiction","authors":"Andrew Piper","doi":"10.22148/001c.38739","DOIUrl":"https://doi.org/10.22148/001c.38739","url":null,"abstract":"This paper attempts to replicate the findings of the recent work, “The rise and fall of biodiversity in literature,” by Langer et al. (2021). Using a large corpus from Project Gutenberg (N = ~15,000) and a dictionary-matching method of over 240K biological taxa, Langer et al. find that the frequency and diversity of biological taxa have been declining steadily since the first half of the nineteenth century, echoing prior work in cultural analytics. This paper applies the original paper’s three primary measures to two additional data sets along with the original dataset and compares their dictionary-based method with an alternative supervised machine learning method. I find that the trajectory of biological tokens in fiction in the new data sets is directionally opposite to that shown by Langer et al. independent of the methods used (i.e. taxa rise rather than fall since the first half of the nineteenth century) but that their breakpoint estimation appears largely robust within +/- 15 years. Based on this analysis, I suggest that the discrepancy between our results is due to corpus construction rather than choice of method. I find that only conditioning on fiction in the original dataset generates results more similar to the two alternative datasets used here. In addition to emphasizing the importance of corpus construction for cultural analytics, these findings also raise larger questions about the difficulties of interpreting lexical items as indeces of social attitudes, pointing to a need for future work.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42834109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Evolution of the Idiolect over the Lifetime: A Quantitative and Qualitative Study of French 19th Century Literature 一生中惯用语的演变:19世纪法国文学的定量与定性研究
Q1 Arts and Humanities Pub Date : 2022-09-01 DOI: 10.22148/001c.37588
Olga Seminck, P. Gambette, Dominique Legallois, T. Poibeau
The way in which authors express themselves is unique but changes over their lifetime. However, quantitative studies of this idiolectal evolution are rare. Using the Corpus for Idiolectal Research (CIDRE) that contains the dated works of 11 prolific 19th century French fiction writers, we propose new methods to identify, quantify and describe the grammatical-stylistic changes that take place using lexico-morphosyntactic patterns, also called motifs. To examine the strength of the chronological signal of change, we developed a method to calculate if a distance matrix of literary works contains a stronger chronological signal than expected by chance. Ten out of 11 corpora showed a higher than chance chronological signal, leading us to conclude that the evolution of the idiolect is in a mathematical sense monotonic, supporting the rectilinearity hypothesis previously put forward in the stylometric literature. The rectilinear property of the evolution of the idiolect found for most authors in CIDRE subsequently enabled us to propose a machine learning task: predicting the year in which a work was written. For the majority of the authors in our corpus, the accuracy and the amount of variance that is explained by the model were high and we discuss why the technique might fail for others. After applying a feature selection algorithm, we examined the most important features, i.e. the motifs that have the greatest influence on idiolectal evolution. We find that some of those features are stylistic and have been previously identified in qualitative literature studies. We report some remarkable stylistic constructions revealed by our algorithm to illustrate which kind of stylistic patterns can be extracted using our method.
作家表达自己的方式是独特的,但随着他们的一生而变化。然而,这种个体进化的定量研究很少。利用包含11位19世纪多产法国小说作家作品的成语研究语料库(CIDRE),我们提出了新的方法来识别,量化和描述使用词汇-形态-句法模式(也称为母旨)发生的语法-风格变化。为了检验时间变化信号的强度,我们开发了一种方法来计算文学作品的距离矩阵是否包含比偶然预期的更强的时间变化信号。11个语料库中有10个显示出高于偶然的时间顺序信号,这使我们得出结论,从数学意义上讲,习语的演变是单调的,支持了先前在文体学文献中提出的线性假设。大多数作者在CIDRE中发现的惯语进化的线性特性随后使我们能够提出一个机器学习任务:预测作品写作的年份。对于我们语料库中的大多数作者来说,该模型解释的准确性和方差量很高,我们讨论了为什么该技术可能会对其他人失败。在应用特征选择算法后,我们检查了最重要的特征,即对个体进化影响最大的基序。我们发现其中一些特征是文体上的,并且已经在先前的定性文献研究中被确定。我们报告了算法揭示的一些显著的文体结构,以说明使用我们的方法可以提取哪些文体模式。
{"title":"The Evolution of the Idiolect over the Lifetime: A Quantitative and Qualitative Study of French 19th Century Literature","authors":"Olga Seminck, P. Gambette, Dominique Legallois, T. Poibeau","doi":"10.22148/001c.37588","DOIUrl":"https://doi.org/10.22148/001c.37588","url":null,"abstract":"The way in which authors express themselves is unique but changes over their lifetime. However, quantitative studies of this idiolectal evolution are rare. Using the Corpus for Idiolectal Research (CIDRE) that contains the dated works of 11 prolific 19th century French fiction writers, we propose new methods to identify, quantify and describe the grammatical-stylistic changes that take place using lexico-morphosyntactic patterns, also called motifs. To examine the strength of the chronological signal of change, we developed a method to calculate if a distance matrix of literary works contains a stronger chronological signal than expected by chance. Ten out of 11 corpora showed a higher than chance chronological signal, leading us to conclude that the evolution of the idiolect is in a mathematical sense monotonic, supporting the rectilinearity hypothesis previously put forward in the stylometric literature. The rectilinear property of the evolution of the idiolect found for most authors in CIDRE subsequently enabled us to propose a machine learning task: predicting the year in which a work was written. For the majority of the authors in our corpus, the accuracy and the amount of variance that is explained by the model were high and we discuss why the technique might fail for others. After applying a feature selection algorithm, we examined the most important features, i.e. the motifs that have the greatest influence on idiolectal evolution. We find that some of those features are stylistic and have been previously identified in qualitative literature studies. We report some remarkable stylistic constructions revealed by our algorithm to illustrate which kind of stylistic patterns can be extracted using our method.","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41815815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Literary value in the era of big data. Operationalizing critical distance in professional and non-professional reviews 大数据时代的文学价值。在专业和非专业评审中操作临界距离
Q1 Arts and Humanities Pub Date : 2022-06-16 DOI: 10.22148/001c.36446
M. Salgaro
New phenomena such as digital social reading, instapoets, and the “rating culture” expressed in online reviews challenge traditional literary criticism in newspapers and journals. Millions of reviews on platforms such as Amazon or Goodreads are part of this culture of participation and a counterweight to professional criticism. At the same time, successful instapoets such as Rupi Kaur reject the expertise of the gatekeepers of “prestigious literary circles” and try to establish a direct connection with readers. The aim of this paper is to build the proper methodological framework to capture these changes in the current literary system. To do this, the phenomenon of online reviewing has to be contextualized within the history and the praxis of assigning literary value to literary texts, the so-called canonization. In addition, literary theory needs to be able to analyze quantitative data and to integrate numbers into its models (engaging in a procedure that is called operationalization).
数字社交阅读、instapoet和网络评论中表达的“评分文化”等新现象挑战了报纸和期刊上的传统文学批评。亚马逊(Amazon)或Goodreads等平台上数以百万计的评论是这种参与文化的一部分,也是对专业批评的一种制衡。与此同时,鲁比•考尔(Rupi Kaur)等成功的微博作者拒绝接受“著名文学界”看门人的专业知识,而是试图与读者建立直接联系。本文的目的是建立适当的方法论框架,以捕捉这些变化在当前的文学系统。要做到这一点,网络评论现象必须在历史和赋予文学文本文学价值的实践中进行语境化,即所谓的册封。此外,文学理论需要能够分析定量数据,并将数字整合到模型中(参与一个称为操作化的过程)。
{"title":"Literary value in the era of big data. Operationalizing critical distance in professional and non-professional reviews","authors":"M. Salgaro","doi":"10.22148/001c.36446","DOIUrl":"https://doi.org/10.22148/001c.36446","url":null,"abstract":"New phenomena such as digital social reading, instapoets, and the “rating culture” expressed in online reviews challenge traditional literary criticism in newspapers and journals. Millions of reviews on platforms such as Amazon or Goodreads are part of this culture of participation and a counterweight to professional criticism. At the same time, successful instapoets such as Rupi Kaur reject the expertise of the gatekeepers of “prestigious literary circles” and try to establish a direct connection with readers. The aim of this paper is to build the proper methodological framework to capture these changes in the current literary system. To do this, the phenomenon of online reviewing has to be contextualized within the history and the praxis of assigning literary value to literary texts, the so-called canonization. In addition, literary theory needs to be able to analyze quantitative data and to integrate numbers into its models (engaging in a procedure that is called operationalization).","PeriodicalId":33005,"journal":{"name":"Journal of Cultural Analytics","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45947401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Journal of Cultural Analytics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1