Digital Scholarship in the Humanities最新文献

Social network analysis of the Babylonian Talmud 巴比伦塔木德经》的社会网络分析

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-07-18 DOI: 10.1093/llc/fqae037

Michael L Satlow, Michael Sperling

This article analyzes the citation network of the Babylonian Talmud, building on an earlier article that we published (Satlow and Sperling 2022). The article has three goals. Our first goal is to show how an ontological-based information extraction system combined with pattern matching can successfully extract structured data from a very complicated, unstructured text. Our second goal is to extend our previous analysis and demonstrate how citation data might lead to wider conclusions about redactional patterns. In addition to highlighting the citation tendencies of different tractates (which could indicate different redactors for those tractates), we hypothesize that there existed a source document originating in the circle of Rav Yehudah bar Yehezkel, used by at least some redactors, and that the character of Rabbi Zeira deserves further attention as an important figure connecting different nodes on the network. Finally, we seek to outline an analytical workflow that could be helpful to other historical projects in the digital humanities.

本文分析了《巴比伦塔木德经》的引文网络，以我们之前发表的一篇文章（Satlow and Sperling 2022）为基础。本文有三个目标。第一个目标是展示基于本体的信息提取系统如何结合模式匹配，成功地从非常复杂的非结构化文本中提取结构化数据。我们的第二个目标是扩展我们之前的分析，并展示引文数据如何能为编辑模式带来更广泛的结论。除了强调不同篇章的引用倾向（这可能表明这些篇章有不同的节录者）之外，我们还假设存在一个源文件，该源文件源自 Rav Yehudah bar Yehezkel 的圈子，至少被一些节录者使用，而拉比-泽拉（Rabbi Zeira）作为连接网络上不同节点的重要人物值得进一步关注。最后，我们试图勾勒出一种分析工作流程，它可能对数字人文领域的其他历史项目有所帮助。

引用次数: 0

Ancient classical theatre from the digital humanities: a systematic review 2010–21 数字人文学科中的古代古典戏剧：2010-21 年系统回顾

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-06-29 DOI: 10.1093/llc/fqae033

Roxana Beatriz Martínez Nieto, Monika Dabrowska

The aim of this article is to offer a systematic review of digital studies that provide new research perspectives on ancient classical theatre. The undeniable progress in the field of computational analysis in the service of traditional textual interpretation is helping to study in greater depth and to interpret in greater detail the classical linguistic corpora that have come down to us through the manuscript tradition. The new model of digital research is integrated not only in the field of information technologies, but also in the field of e-learning, where we can already observe the implementation of a new educational model. Based on the digital processing of data on Greco-Roman theatre, a systematic review is presented, following the methodological principles of the PRISMA statement [Preferred Reporting Items for Systematic Reviews and Meta-Analyses 2009 (2020)].

本文旨在对为古代古典戏剧提供新研究视角的数字研究进行系统回顾。不可否认，计算分析领域在传统文本解读方面取得的进展有助于更深入地研究和更详细地解读通过手稿传统流传下来的古典语言语料库。数字化研究的新模式不仅融入了信息技术领域，而且也融入了电子学习领域，在这一领域，我们已经可以看到一种新的教育模式的实施。在对希腊-罗马戏剧数据进行数字化处理的基础上，按照 PRISMA 声明[2009 年（2020 年）系统综述和元分析首选报告项目]的方法论原则，提出了一项系统综述。

引用次数: 0

Personality prediction via multi-task transformer architecture combined with image aesthetics 通过多任务转换器架构结合图像美学进行人格预测

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-06-22 DOI: 10.1093/llc/fqae034

Shahryar Salmani Bajestani, Mohammad Mahdi Khalilzadeh, Mahdi Azarnoosh, Hamid Reza Kobravi

Social media has found its path into the daily lives of people. There are several ways that users communicate in which liking and sharing images stands out. Each image shared by a user can be analyzed from aesthetic and personality traits views. In recent studies, it has been proved that personality traits impact personalized image aesthetics assessment. In this article, the same pattern was studied from a different perspective. So, we evaluated the impact of image aesthetics on personality traits to check if there is any relation between them in this form. Hence, in a two-stage architecture, we have leveraged image aesthetics to predict the personality traits of users. The first stage includes a multi-task deep learning paradigm that consists of an encoder/decoder in which the core of the network is a Swin Transformer. The second stage combines image aesthetics and personality traits with an attention mechanism for personality trait prediction. The results showed that the proposed method had achieved an average Spearman Rank Order Correlation Coefficient (SROCC) of 0.776 in image aesthetic on the Flickr-AES database and an average SROCC of 0.6730 on the PsychoFlickr database, which outperformed related SOTA (State of the Art) studies. The average accuracy performance of the first stage was boosted by 7.02 per cent in the second stage, considering the influence of image aesthetics on personality trait prediction.

社交媒体已进入人们的日常生活。用户有多种交流方式，其中喜欢和分享图片的方式最为突出。用户分享的每张图片都可以从审美和个性特征的角度进行分析。最近的研究证明，个性特征会影响个性化图片美学评估。在本文中，我们从不同的角度研究了相同的模式。因此，我们评估了形象美学对人格特质的影响，以检查在这种形式下它们之间是否存在任何关系。因此，在一个两阶段的架构中，我们利用图像美学来预测用户的个性特征。第一阶段包括多任务深度学习范式，由编码器/解码器组成，其中网络的核心是 Swin Transformer。第二阶段将图像美学和个性特征结合起来，利用注意力机制进行个性特征预测。结果表明，所提出的方法在 Flickr-AES 数据库的图像美学方面取得了平均 0.776 的斯皮尔曼秩相关系数（SROCC），在 PsychoFlickr 数据库的图像美学方面取得了平均 0.6730 的斯皮尔曼秩相关系数（SROCC），优于相关的 SOTA（艺术水平）研究。考虑到图像美学对人格特质预测的影响，第一阶段的平均准确率在第二阶段提高了 7.02%。

{"title":"Personality prediction via multi-task transformer architecture combined with image aesthetics","authors":"Shahryar Salmani Bajestani, Mohammad Mahdi Khalilzadeh, Mahdi Azarnoosh, Hamid Reza Kobravi","doi":"10.1093/llc/fqae034","DOIUrl":"https://doi.org/10.1093/llc/fqae034","url":null,"abstract":"Social media has found its path into the daily lives of people. There are several ways that users communicate in which liking and sharing images stands out. Each image shared by a user can be analyzed from aesthetic and personality traits views. In recent studies, it has been proved that personality traits impact personalized image aesthetics assessment. In this article, the same pattern was studied from a different perspective. So, we evaluated the impact of image aesthetics on personality traits to check if there is any relation between them in this form. Hence, in a two-stage architecture, we have leveraged image aesthetics to predict the personality traits of users. The first stage includes a multi-task deep learning paradigm that consists of an encoder/decoder in which the core of the network is a Swin Transformer. The second stage combines image aesthetics and personality traits with an attention mechanism for personality trait prediction. The results showed that the proposed method had achieved an average Spearman Rank Order Correlation Coefficient (SROCC) of 0.776 in image aesthetic on the Flickr-AES database and an average SROCC of 0.6730 on the PsychoFlickr database, which outperformed related SOTA (State of the Art) studies. The average accuracy performance of the first stage was boosted by 7.02 per cent in the second stage, considering the influence of image aesthetics on personality trait prediction.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"29 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141529784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Language-based machine perception: linguistic perspectives on the compilation of captioning datasets 基于语言的机器感知：从语言学角度看字幕数据集的编制工作

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-06-22 DOI: 10.1093/llc/fqae029

Laura Hekanaho, Maija Hirvonen, Tuomas Virtanen

Over the last decade, a plethora of training datasets have been compiled for use in language-based machine perception and in human-centered AI, alongside research regarding their compilation methods. From a primarily linguistic perspective, we add to these studies in two ways. First, we provide an overview of sixty-six training datasets used in automatic image, video, and audio captioning, examining their compilation methods with a metadata analysis. Second, we delve into the annotation process of crowdsourced datasets with an interest in understanding the linguistic factors that affect the form and content of the captions, such as contextualization and perspectivation. With a qualitative content analysis, we examine annotator instructions with a selection of eleven datasets. Drawing from various theoretical frameworks that help assess the effectiveness of the instructions, we discuss the visual and textual presentation of the instructions, as well as the perspective-guidance that is an essential part of the language instructions. While our analysis indicates that some standards in the formulation of instructions seem to have formed in the field, we also identified various reoccurring issues potentially hindering readability and comprehensibility of the instructions, and therefore, caption quality. To enhance readability, we emphasize the importance of text structure, organization of the information, consistent use of typographical cues, and clarity of language use. Last, engaging with previous research, we assess the compilation of both web-sourced and crowdsourced captioning datasets from various perspectives, discussing factors affecting the diversity of the datasets.

在过去的十年中，已经有大量的训练数据集被编译用于基于语言的机器感知和以人为中心的人工智能，同时还有关于其编译方法的研究。我们主要从语言学的角度，从两个方面对这些研究进行补充。首先，我们概述了用于自动图像、视频和音频字幕的 66 个训练数据集，并通过元数据分析研究了它们的编译方法。其次，我们深入研究了众包数据集的注释过程，希望了解影响字幕形式和内容的语言因素，如语境化和视角化。通过定性内容分析，我们选取了 11 个数据集来研究注释者的说明。我们借鉴了有助于评估说明有效性的各种理论框架，讨论了说明的视觉和文本呈现方式，以及作为语言说明重要组成部分的视角指导。我们的分析表明，在制定说明方面似乎已经形成了一些标准，但我们也发现了一些反复出现的问题，这些问题可能会妨碍说明的可读性和可理解性，从而影响字幕质量。为了提高可读性，我们强调了文本结构、信息组织、排版提示的连贯使用以及语言使用清晰度的重要性。最后，结合之前的研究，我们从不同角度评估了网络来源和众包字幕数据集的编译情况，并讨论了影响数据集多样性的因素。

{"title":"Language-based machine perception: linguistic perspectives on the compilation of captioning datasets","authors":"Laura Hekanaho, Maija Hirvonen, Tuomas Virtanen","doi":"10.1093/llc/fqae029","DOIUrl":"https://doi.org/10.1093/llc/fqae029","url":null,"abstract":"Over the last decade, a plethora of training datasets have been compiled for use in language-based machine perception and in human-centered AI, alongside research regarding their compilation methods. From a primarily linguistic perspective, we add to these studies in two ways. First, we provide an overview of sixty-six training datasets used in automatic image, video, and audio captioning, examining their compilation methods with a metadata analysis. Second, we delve into the annotation process of crowdsourced datasets with an interest in understanding the linguistic factors that affect the form and content of the captions, such as contextualization and perspectivation. With a qualitative content analysis, we examine annotator instructions with a selection of eleven datasets. Drawing from various theoretical frameworks that help assess the effectiveness of the instructions, we discuss the visual and textual presentation of the instructions, as well as the perspective-guidance that is an essential part of the language instructions. While our analysis indicates that some standards in the formulation of instructions seem to have formed in the field, we also identified various reoccurring issues potentially hindering readability and comprehensibility of the instructions, and therefore, caption quality. To enhance readability, we emphasize the importance of text structure, organization of the information, consistent use of typographical cues, and clarity of language use. Last, engaging with previous research, we assess the compilation of both web-sourced and crowdsourced captioning datasets from various perspectives, discussing factors affecting the diversity of the datasets.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"44 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141508991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Who wrote the first Constitutions of Freemasonry? 共济会的第一部章程是谁写的？

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-06-01 DOI: 10.1093/llc/fqae023

Róbert Péter, Alejandro Napolitano Jawerbaum

This article addresses the problematic authorship of The Constitutions of the Free-Masons (1723). Traditionally associated with James Anderson, using stylometry, we examine whether and, if so, where John T. Desaguliers, the prime mover of early English institutionalized Freemasonry, contributed to this publication. Our corpus includes writings by Anderson, Desaguliers, and two contemporary Freemasons used as distractors. The transcribed works contain texts from different genres and of varying lengths. In our methodology, we employ a wide range of robust, multivariate, unsupervised, and cross-validated supervised tests, verified through significance testing, which can hopefully contribute to the establishment of standards for historical authorship attribution. Our results suggest, in line with historical evidence, that the legendary history of the Constitutions was most likely primarily authored by Anderson. However, several of the Charges including the first one ‘Concerning God and religion’, one of the most disputed texts in the history of Freemasonry, are closer to the style of Desaguliers. The General Regulations concerning the organization of the lodges, hitherto attributed to George Payne, played a fundamental role in spreading Freemasonry worldwide. Our analyses show that the stylistic affinity of fifteen of the thirty-nine regulations has a pronounced closeness to Anderson’s style, five align more closely with Desaguliers’ style. The authorship of the rest remains inconclusive partly due to the insufficient length of texts by Payne. These novel findings are also supported by a close reading of the Constitutions and other contemporary primary sources.

本文探讨了《共济会章程》（1723 年）的作者问题。传统上，《共济会章程》的作者是詹姆斯-安德森（James Anderson），我们利用文体测量法，研究了英国早期共济会制度化的主要推动者约翰-T-德萨古利尔（John T. Desaguliers）是否为该出版物做出了贡献，以及如果是，他在哪里做出了贡献。我们的语料库包括安德森、德萨古里埃和两位当代共济会员的著作。转录的作品包含不同体裁和不同长度的文本。在我们的研究方法中，我们采用了一系列稳健、多元、无监督和交叉验证的监督测试，并通过显著性测试进行了验证，希望这些测试能为建立历史作者归属标准做出贡献。我们的研究结果表明，根据历史证据，《宪法》的传奇历史很可能主要由安德森撰写。然而，包括第一条 "关于上帝和宗教"（共济会历史上最有争议的文本之一）在内的几条训令更接近德萨古里埃的风格。迄今为止，乔治-佩恩（George Payne）所编写的有关组织结社的《总条例》在向全世界传播共济会方面发挥了重要作用。我们的分析表明，在 39 份条例中，有 15 份在风格上明显接近安德森的风格，有 5 份更接近德萨古里埃的风格。其余条例的作者仍然没有定论，部分原因是佩恩的文本篇幅不够。这些新发现也得到了《宪法》和其他当代原始资料的细读支持。

{"title":"Who wrote the first Constitutions of Freemasonry?","authors":"Róbert Péter, Alejandro Napolitano Jawerbaum","doi":"10.1093/llc/fqae023","DOIUrl":"https://doi.org/10.1093/llc/fqae023","url":null,"abstract":"This article addresses the problematic authorship of The Constitutions of the Free-Masons (1723). Traditionally associated with James Anderson, using stylometry, we examine whether and, if so, where John T. Desaguliers, the prime mover of early English institutionalized Freemasonry, contributed to this publication. Our corpus includes writings by Anderson, Desaguliers, and two contemporary Freemasons used as distractors. The transcribed works contain texts from different genres and of varying lengths. In our methodology, we employ a wide range of robust, multivariate, unsupervised, and cross-validated supervised tests, verified through significance testing, which can hopefully contribute to the establishment of standards for historical authorship attribution. Our results suggest, in line with historical evidence, that the legendary history of the Constitutions was most likely primarily authored by Anderson. However, several of the Charges including the first one ‘Concerning God and religion’, one of the most disputed texts in the history of Freemasonry, are closer to the style of Desaguliers. The General Regulations concerning the organization of the lodges, hitherto attributed to George Payne, played a fundamental role in spreading Freemasonry worldwide. Our analyses show that the stylistic affinity of fifteen of the thirty-nine regulations has a pronounced closeness to Anderson’s style, five align more closely with Desaguliers’ style. The authorship of the rest remains inconclusive partly due to the insufficient length of texts by Payne. These novel findings are also supported by a close reading of the Constitutions and other contemporary primary sources.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"79 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141191219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Parameterization of manipulative media discourse: possibilities and problems of automatic diagnosis 操纵性媒体话语的参数化：自动诊断的可能性和问题

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-05-08 DOI: 10.1093/llc/fqae024

Maigul Shakenova, Dybys Tashimkhanova, Gulvira Shaikova, Ulzhan Ospanova, Olga Popovich

The issue of quantitative measurement and automatic processing is a significant problem in determining the markers of the manipulative potential of media texts, since linguistic indicators are the basis of machine parameterization. The purpose of the research is to analyse the possibilities of the main language parameters of the manipulativeness of media discourse, which can be identified using machine learning. To achieve the research goals, the following methods were used: system, content analysis, computer modelling, and comparative. The results of the article determined that such language indicators as use of the subjunctive mood of verbs, capital letters, high frequency of use of the ‘not’ particle, punctuation marks, questions, or exclamations of a rhetorical nature, use of quotation marks for the purpose of irony, double negative sentences, use of the word ‘no’, and verbal structures calling to action act as computer classification parameters. In order to cover the above purpose, PYTHON software was implemented that allowed texts to be analysed and visualized in algorithmic and lexical-vocabulary ways. In addition, it was determined that by integrating the PYTHON tool, it became possible to use language transformation markers that formed linguistic patterns in the analysed text. The list of parameters for diagnosing manipulative texts is non-exhaustive, which emphasizes the possibility of machine measurement of the manipulative component of mass media discourse.

定量测量和自动处理问题是确定媒体文本操纵潜力标记的一个重要问题，因为语言指标是机器参数化的基础。本研究的目的是分析媒体话语操纵性主要语言参数的可能性，这些参数可以通过机器学习来确定。为实现研究目标，使用了以下方法：系统、内容分析、计算机建模和比较。文章的研究结果确定，动词从句语气的使用、大写字母、"不 "的高频率使用、标点符号、疑问句或具有修辞性质的感叹句、为反讽目的而使用引号、双重否定句、"不 "的使用以及号召行动的言语结构等语言指标可作为计算机分类参数。为了达到上述目的，我们使用了PYTHON软件，该软件允许以算法和词汇的方式对文本进行分析和可视化。此外，通过整合 "PYTHON "工具，还可以使用语言转换标记，在分析文本中形成语言模式。用于诊断操纵性文本的参数清单并非详尽无遗，这强调了对大众媒体话语中的操纵性成分进行机器测量的可能性。

{"title":"Parameterization of manipulative media discourse: possibilities and problems of automatic diagnosis","authors":"Maigul Shakenova, Dybys Tashimkhanova, Gulvira Shaikova, Ulzhan Ospanova, Olga Popovich","doi":"10.1093/llc/fqae024","DOIUrl":"https://doi.org/10.1093/llc/fqae024","url":null,"abstract":"The issue of quantitative measurement and automatic processing is a significant problem in determining the markers of the manipulative potential of media texts, since linguistic indicators are the basis of machine parameterization. The purpose of the research is to analyse the possibilities of the main language parameters of the manipulativeness of media discourse, which can be identified using machine learning. To achieve the research goals, the following methods were used: system, content analysis, computer modelling, and comparative. The results of the article determined that such language indicators as use of the subjunctive mood of verbs, capital letters, high frequency of use of the ‘not’ particle, punctuation marks, questions, or exclamations of a rhetorical nature, use of quotation marks for the purpose of irony, double negative sentences, use of the word ‘no’, and verbal structures calling to action act as computer classification parameters. In order to cover the above purpose, PYTHON software was implemented that allowed texts to be analysed and visualized in algorithmic and lexical-vocabulary ways. In addition, it was determined that by integrating the PYTHON tool, it became possible to use language transformation markers that formed linguistic patterns in the analysed text. The list of parameters for diagnosing manipulative texts is non-exhaustive, which emphasizes the possibility of machine measurement of the manipulative component of mass media discourse.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"46 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A statistical approach to Hollywood remake and sequel metadata 好莱坞翻拍和续集元数据统计方法

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-05-02 DOI: 10.1093/llc/fqae012

Agata Hołobut, Jan Rybicki, Miłosz Stelmach

Hollywood film remakes, as old as the cinema itself, have attracted much professional, critical, and academic attention. They have been viewed by art critics as products of cultural derivativity and imperialism and commended by financial experts as low-risk business investments, closely linked to other forms of brand extension, such as sequels and bestseller adaptations. In this article, we adopt a film-historical quantitative approach to Hollywood film remakes by analysing metadata obtained from the Internet Movie Database (IMDb) and verified against reliable print and web sources. We analyse 986 Hollywood remakes produced between 1915 and 2020 in terms of raw and relative frequencies of annual releases, genre (in)stability, and patterns of transnational reproduction. We contrast our findings with those outlined by Henderson (2014a) in his statistical survey of Hollywood sequels, series films, prequels, and spin-offs, presented in his monograph The Hollywood Sequel: History and Form, 1911–2010. Having completed his list with recent sequential productions released between 2011 and 2020, we investigate the potential parallels between Hollywood remaking and sequelization practices. Our findings demonstrate historical discrepancies in various ‘content recycling’ trends, which help better characterize the cultural and commercial significance of remakes and serial forms in the American film industry.

好莱坞电影翻拍与电影本身一样历史悠久，吸引了众多专业、评论界和学术界的关注。艺术评论家认为翻拍是文化衍生和帝国主义的产物，而金融专家则称赞翻拍是低风险的商业投资，与其他形式的品牌延伸（如续集和畅销书改编）密切相关。在本文中，我们通过分析从互联网电影数据库（IMDb）中获取的元数据，并与可靠的印刷品和网络资料进行核对，对好莱坞电影翻拍采用了电影史定量方法。我们分析了 1915 年至 2020 年期间制作的 986 部好莱坞翻拍影片，从每年发行的原始频率和相对频率、类型（不）稳定性以及跨国复制模式等方面进行了分析。我们将研究结果与亨德森（2014a）在其专著《好莱坞续集》（The Hollywood Sequel）中对好莱坞续集、系列电影、前传和衍生作品的统计调查结果进行对比：历史与形式，1911-2010 年》。我们在完成了他的清单后，结合 2011 年至 2020 年间上映的近期续集作品，研究了好莱坞重拍和续集化做法之间的潜在相似之处。我们的研究结果表明了各种 "内容循环 "趋势的历史差异，这有助于更好地描述翻拍和连续剧形式在美国电影业中的文化和商业意义。

{"title":"A statistical approach to Hollywood remake and sequel metadata","authors":"Agata Hołobut, Jan Rybicki, Miłosz Stelmach","doi":"10.1093/llc/fqae012","DOIUrl":"https://doi.org/10.1093/llc/fqae012","url":null,"abstract":"Hollywood film remakes, as old as the cinema itself, have attracted much professional, critical, and academic attention. They have been viewed by art critics as products of cultural derivativity and imperialism and commended by financial experts as low-risk business investments, closely linked to other forms of brand extension, such as sequels and bestseller adaptations. In this article, we adopt a film-historical quantitative approach to Hollywood film remakes by analysing metadata obtained from the Internet Movie Database (IMDb) and verified against reliable print and web sources. We analyse 986 Hollywood remakes produced between 1915 and 2020 in terms of raw and relative frequencies of annual releases, genre (in)stability, and patterns of transnational reproduction. We contrast our findings with those outlined by Henderson (2014a) in his statistical survey of Hollywood sequels, series films, prequels, and spin-offs, presented in his monograph The Hollywood Sequel: History and Form, 1911–2010. Having completed his list with recent sequential productions released between 2011 and 2020, we investigate the potential parallels between Hollywood remaking and sequelization practices. Our findings demonstrate historical discrepancies in various ‘content recycling’ trends, which help better characterize the cultural and commercial significance of remakes and serial forms in the American film industry.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"22 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140829422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Gender-specific features in contemporary Japanese names 当代日本姓名中的性别特征

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-05-01 DOI: 10.1093/llc/fqae022

Ivona Barešová, Tereza Nakaya, Vladimír Matlach

Contemporary Japanese given names exhibit great variety and have minimal formal restrictions in their formation. It is often possible, however, to determine the gender of the name's bearer from its phonological and/or graphic form. In this article, various features, including name length, syllables, and characters at particular positions within a name and the choice of script, are statistically analyzed to determine whether they are significantly associated with male or female names and which of them contribute the most to the expression of gender. The findings of this study verify the empirical knowledge of the gender-markedness of some of the features and establish a solid foundation for future feature-based gender prediction algorithms. The expression of gender in currently bestowed names is discussed in the context of major changes in naming practices and name choices toward the end of the 20th century.

当代日本人的姓氏种类繁多，在形式上的限制极少。不过，通常可以从名字的语音和/或图形形式来确定名字持有人的性别。本文对各种特征进行了统计分析，包括名字的长度、音节、名字中特定位置的字符以及文字的选择，以确定它们是否与男性或女性名字有显著关联，以及哪种特征最有助于表达性别。这项研究的结果验证了一些特征的性别标记经验知识，并为未来基于特征的性别预测算法奠定了坚实的基础。本研究结合 20 世纪末命名实践和姓名选择的重大变化，讨论了目前所赐姓名中的性别表达。

引用次数: 0

Explaining the spatial segregation of ethnic groups in an early industrial city: the case of Vyborg 解释早期工业城市的族群空间分隔：维堡案例

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-04-27 DOI: 10.1093/llc/fqae017

Antti Härkönen

An early industrial town’s spatial segregation is studied using empirical data concerning the Russian population of the town of Vyborg. Several hypotheses for explaining segregation are considered using spatial analysis. The spatial data are derived from historical maps and demographic data from various tax records. Socioeconomic segregation is studied as a possible cause of ethnic segregation. The main drivers of spatial segregation were the explicit policies of segregation enforced by both the Russian military administration and the town’s civilian administration. While the effects of segregation gradually diminished due to social diffusion, the impact of policy decisions driving segregation in the 18th and early 19th centuries was still visible in the population’s later 19th-century segregation. Yet neither the different preferences of Russians and others nor the income differences between areas explains the distribution of Russians. Segregation based on the membership of a guild was insignificant, with a few exceptions. Other factors such as discrimination, prejudice, and differences in housing market information probably contributed to segregation, but they cannot be studied with the data used.

本文利用有关维堡镇俄罗斯人口的经验数据，研究了一个早期工业城镇的空间隔离问题。通过空间分析，考虑了几种解释空间隔离的假设。空间数据来自历史地图和各种税收记录中的人口数据。研究将社会经济隔离作为种族隔离的可能原因。空间隔离的主要驱动因素是俄罗斯军事管理部门和城镇民政管理部门明确执行的隔离政策。虽然由于社会扩散，隔离的影响逐渐减弱，但 18 世纪和 19 世纪初推动隔离的政策决定的影响在 19 世纪后期的人口隔离中仍然可见。然而，无论是俄罗斯人和其他人的不同偏好，还是地区之间的收入差异，都无法解释俄罗斯人的分布情况。除少数例外情况外，基于行会成员身份的隔离并不明显。其他因素，如歧视、偏见和住房市场信息的差异，可能也是造成隔离的原因之一，但使用的数据无法对这些因素进行研究。

{"title":"Explaining the spatial segregation of ethnic groups in an early industrial city: the case of Vyborg","authors":"Antti Härkönen","doi":"10.1093/llc/fqae017","DOIUrl":"https://doi.org/10.1093/llc/fqae017","url":null,"abstract":"An early industrial town’s spatial segregation is studied using empirical data concerning the Russian population of the town of Vyborg. Several hypotheses for explaining segregation are considered using spatial analysis. The spatial data are derived from historical maps and demographic data from various tax records. Socioeconomic segregation is studied as a possible cause of ethnic segregation. The main drivers of spatial segregation were the explicit policies of segregation enforced by both the Russian military administration and the town’s civilian administration. While the effects of segregation gradually diminished due to social diffusion, the impact of policy decisions driving segregation in the 18th and early 19th centuries was still visible in the population’s later 19th-century segregation. Yet neither the different preferences of Russians and others nor the income differences between areas explains the distribution of Russians. Segregation based on the membership of a guild was insignificant, with a few exceptions. Other factors such as discrimination, prejudice, and differences in housing market information probably contributed to segregation, but they cannot be studied with the data used.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"75 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140809236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Is medieval distant viewing possible? : Extending and enriching annotation of legacy image collections using visual analytics 中世纪远观是否可能：利用视觉分析扩展和丰富传统图像收藏的注释内容

IF 0.8 3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2024-04-24 DOI: 10.1093/llc/fqae020

Christofer Meinecke, Estelle Guéville, David Joseph Wrisley, Stefan Jänicke

Distant viewing approaches have typically used image datasets close to the contemporary image data used to train machine learning models. To work with images from other historical periods requires expert annotated data, and the quality of labels is crucial for the quality of results. Especially when working with cultural heritage collections that contain myriad uncertainties, annotating data, or re-annotating, legacy data is an arduous task. In this paper, we describe working with two pre-annotated sets of medieval manuscript images that exhibit conflicting and overlapping metadata. Since a manual reconciliation of the two legacy ontologies would be very expensive, we aim (1) to create a more uniform set of descriptive labels to serve as a “bridge” in the combined dataset, and (2) to establish a high-quality hierarchical classification that can be used as a valuable input for subsequent supervised machine learning. To achieve these goals, we developed visualization and interaction mechanisms, enabling medievalists to combine, regularize and extend the vocabulary used to describe these, and other cognate, image datasets. The visual interfaces provide experts an overview of relationships in the data going beyond the sum total of the metadata. Word and image embeddings as well as co-occurrences of labels across the datasets enable batch re-annotation of images, recommendation of label candidates, and support composing a hierarchical classification of labels.

远观方法通常使用与当代图像数据接近的图像数据集来训练机器学习模型。要处理其他历史时期的图像，需要专家注释数据，而标签的质量对结果的质量至关重要。尤其是在处理包含无数不确定因素的文化遗产藏品时，对数据进行注释或重新注释遗留数据是一项艰巨的任务。在本文中，我们介绍了如何处理两套预先标注的中世纪手稿图像，这两套图像的元数据存在冲突和重叠。由于手动调节两个遗留本体论的成本非常高昂，我们的目标是：(1) 创建一套更统一的描述性标签，作为合并数据集的 "桥梁"；(2) 建立高质量的分层分类，作为后续监督机器学习的宝贵输入。为了实现这些目标，我们开发了可视化和交互机制，使中世纪学者能够组合、规范和扩展用于描述这些以及其他同类图像数据集的词汇。可视化界面为专家们提供了超越元数据总和的数据关系概览。单词和图像嵌入以及数据集中标签的共现可以实现图像的批量重新标注、推荐候选标签，并支持对标签进行分层分类。

{"title":"Is medieval distant viewing possible? : Extending and enriching annotation of legacy image collections using visual analytics","authors":"Christofer Meinecke, Estelle Guéville, David Joseph Wrisley, Stefan Jänicke","doi":"10.1093/llc/fqae020","DOIUrl":"https://doi.org/10.1093/llc/fqae020","url":null,"abstract":"Distant viewing approaches have typically used image datasets close to the contemporary image data used to train machine learning models. To work with images from other historical periods requires expert annotated data, and the quality of labels is crucial for the quality of results. Especially when working with cultural heritage collections that contain myriad uncertainties, annotating data, or re-annotating, legacy data is an arduous task. In this paper, we describe working with two pre-annotated sets of medieval manuscript images that exhibit conflicting and overlapping metadata. Since a manual reconciliation of the two legacy ontologies would be very expensive, we aim (1) to create a more uniform set of descriptive labels to serve as a “bridge” in the combined dataset, and (2) to establish a high-quality hierarchical classification that can be used as a valuable input for subsequent supervised machine learning. To achieve these goals, we developed visualization and interaction mechanisms, enabling medievalists to combine, regularize and extend the vocabulary used to describe these, and other cognate, image datasets. The visual interfaces provide experts an overview of relationships in the data going beyond the sum total of the metadata. Word and image embeddings as well as co-occurrences of labels across the datasets enable batch re-annotation of images, recommendation of label candidates, and support composing a hierarchical classification of labels.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"2015 1","pages":""},"PeriodicalIF":0.8,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140801375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0