Digital Scholarship in the Humanities最新文献

英文中文

Eye Tracking in Linguistics. Salvatore Attardo and Lucy Pickering 语言学中的眼动追踪。塞尔瓦托·阿塔多和露西·皮克林

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-11-03 DOI: 10.1093/llc/fqad081

Caterina Cacioli

Journal Article Eye Tracking in Linguistics. Salvatore Attardo and Lucy Pickering Get access Eye Tracking in Linguistics. Salvatore Attardo Lucy Pickering, London, Bloomsbury Academic, 2023, pp. 304, £28.99 (P/B). ISBN: 978-1-3501-1751-8. Caterina Cacioli Caterina Cacioli Lettere e Filosofia, Università degli Studi di Firenze, Italy E-mail: caterina.cacioli@unifi.it https://orcid.org/0000-0002-9994-5770 Search for other works by this author on: Oxford Academic Google Scholar Digital Scholarship in the Humanities, fqad081, https://doi.org/10.1093/llc/fqad081 Published: 03 November 2023

语言学中的眼动追踪。Salvatore Attardo和Lucy Pickering获得语言学中的眼动追踪。塞尔瓦托·阿塔多·露西·皮克林，伦敦，布卢姆斯伯里学术出版社，2023年，第304页，28.99英镑(P/B)。ISBN: 978-1-3501-1751-8。Caterina Cacioli意大利佛罗伦萨大学来信:Filosofia e -mail: caterina.cacioli@unifi.it https://orcid.org/0000-0002-9994-5770搜索作者的其他作品:牛津学术谷歌人文学者数字奖学金，fqad081, https://doi.org/10.1093/llc/fqad081出版日期:2023年11月3日

引用次数: 0

Methodological observations concerning word rankings and z-score refinements 关于单词排名和z分数改进的方法学观察

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-11-03 DOI: 10.1093/llc/fqad079

Hartmut Ilsemann

Abstract This article evaluates word rankings suggested by Ary L. Goldberger, Albert C. Yang, and C. Peng as a means of establishing the authorship of texts in the light of Delta, developed by John Burrows at about the same time. The tests carried out with high ranking function words and results established with the more modern approaches of Rolling Delta, Rolling Classify, and the General Imposters method give clear evidence that word rankings only return crude and unreliable results that cannot keep up with non-traditional modern methods. Even though the stylistic difference between Marlowe and Shakespeare plays could be stated, word rankings failed to recognize Shakespearean stylistics in The Jew of Malta, Edward II, and Doctor Faustus. It was only through the use of z-scores that a wider vocabulary provided a larger degree of differentiation.

摘要本文根据约翰·巴罗斯(John Burrows)在同一时期提出的Delta理论，对Ary L. Goldberger、Albert C. Yang和C. Peng提出的文本作者身份确定方法进行了评价。用高排名的功能词进行的测试以及用滚动三角洲、滚动分类和一般冒名顶替法等更现代的方法建立的结果清楚地表明，单词排名只会返回粗糙和不可靠的结果，无法跟上非传统的现代方法。尽管马洛和莎士比亚戏剧的风格差异可以被陈述出来，但在《马耳他的犹太人》、《爱德华二世》和《浮士德博士》中，单词排名未能识别出莎士比亚的风格。只有通过使用z分数，更广泛的词汇才能提供更大程度的分化。

引用次数: 0

Research Methods for Digital Discourse Analysis. Camilla Vásquez 数字语篇分析研究方法。卡米拉Vasquez

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-11-02 DOI: 10.1093/llc/fqad082

Hang Yu, Sijia Chang

Journal Article Research Methods for Digital Discourse Analysis. Camilla Vásquez Get access Research Methods for Digital Discourse Analysis. Camilla Vásquez, London, Bloomsbury Academic Press, 2022, 330 pp., $108.00 (hbk). ISBN: 978-1-350-16683-7. Hang Yu, Hang Yu School of Foreign Studies, Northwestern Polytechnical University, China E-mail: uibeyuhang@163.com Search for other works by this author on: Oxford Academic Google Scholar Sijia Chang Sijia Chang School of Foreign Studies, Northwestern Polytechnical University, China Search for other works by this author on: Oxford Academic Google Scholar Digital Scholarship in the Humanities, fqad082, https://doi.org/10.1093/llc/fqad082 Published: 02 November 2023

数字语篇分析的期刊文章研究方法。卡米拉Vásquez获取数字话语分析研究方法。卡米拉Vásquez，伦敦，布卢姆斯伯里学术出版社，2022年，330页，108.00美元。ISBN: 978-1-350-16683-7。余航，中国西北工业大学外国语学院余航研究员E-mail: uibeyuhang@163.com作者其他著作搜索:牛津学术谷歌学者张思佳常思佳中国西北工业大学外国语学院作者其他著作搜索:牛津学术谷歌学者人文数字奖学金，fqad082, https://doi.org/10.1093/llc/fqad082出版日期:2023年11月2日

引用次数: 0

The Victorian anti-vaccination discourse corpus (VicVaDis): construction and exploration 维多利亚时代抗疫苗话语语料库(VicVaDis)的构建与探索

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-10-26 DOI: 10.1093/llc/fqad075

Claire Hardaker, Alice Deignan, Elena Semino, Tara Coltman-Patel, William Dance, Zsófia Demjén, Chris Sanderson, Derek Gatherer

Abstract This article introduces and explores the 3.5-million-word Victorian Anti-Vaccination Discourse Corpus (VicVaDis). The corpus is intended to provide a (freely accessible) historical resource for the investigation of the earliest public concerns and arguments against vaccination in England, which revolved around compulsory vaccination against smallpox in the second half of the 19th century. It consists of 133 anti-vaccination pamphlets and publications gathered from 1854 to 1906, a span of 53 years that loosely coincides with the Victorian era (1837–1901). This timeframe was chosen to capture the period between the 1853 Vaccination Act, which made smallpox vaccination for babies compulsory, and the 1907 Act that effectively ended the mandatory nature of vaccination. After an overview of the historical background, this article describes the rationale, design and construction of the corpus, and then demonstrates how it can be exploited to investigate the main arguments against compulsory vaccination by means of widely accessible corpus linguistic tools. Where appropriate, parallels are drawn between Victorian and 21st-century vaccine-hesitant attitudes and arguments. Overall, this article demonstrates the potential of corpus analysis to add to our understanding of historical concerns about vaccination.

摘要本文介绍并探讨了350万字的维多利亚时代抗疫苗话语语料库(VicVaDis)。该语料库旨在提供一个(免费访问的)历史资源，用于调查英国最早的公众关注和反对接种疫苗的争论，这些争论围绕着19世纪下半叶的强制接种天花疫苗。它包括从1854年到1906年收集的133份反疫苗小册子和出版物，这53年的时间跨度与维多利亚时代(1837-1901)大致吻合。选择这个时间框架是为了捕捉1853年《疫苗接种法》和1907年《疫苗接种法》之间的时间，《疫苗接种法》规定婴儿必须接种天花疫苗，而《疫苗接种法》实际上终止了疫苗接种的强制性。在概述了历史背景之后，本文描述了语料库的基本原理、设计和构建，然后展示了如何利用它来调查反对强制接种疫苗的主要论点，通过广泛使用的语料库语言工具。在适当的情况下，将维多利亚时代和21世纪对疫苗犹豫不决的态度和论点进行比较。总的来说，这篇文章展示了语料库分析的潜力，以增加我们对疫苗接种历史问题的理解。

{"title":"The Victorian anti-vaccination discourse corpus (VicVaDis): construction and exploration","authors":"Claire Hardaker, Alice Deignan, Elena Semino, Tara Coltman-Patel, William Dance, Zsófia Demjén, Chris Sanderson, Derek Gatherer","doi":"10.1093/llc/fqad075","DOIUrl":"https://doi.org/10.1093/llc/fqad075","url":null,"abstract":"Abstract This article introduces and explores the 3.5-million-word Victorian Anti-Vaccination Discourse Corpus (VicVaDis). The corpus is intended to provide a (freely accessible) historical resource for the investigation of the earliest public concerns and arguments against vaccination in England, which revolved around compulsory vaccination against smallpox in the second half of the 19th century. It consists of 133 anti-vaccination pamphlets and publications gathered from 1854 to 1906, a span of 53 years that loosely coincides with the Victorian era (1837–1901). This timeframe was chosen to capture the period between the 1853 Vaccination Act, which made smallpox vaccination for babies compulsory, and the 1907 Act that effectively ended the mandatory nature of vaccination. After an overview of the historical background, this article describes the rationale, design and construction of the corpus, and then demonstrates how it can be exploited to investigate the main arguments against compulsory vaccination by means of widely accessible corpus linguistic tools. Where appropriate, parallels are drawn between Victorian and 21st-century vaccine-hesitant attitudes and arguments. Overall, this article demonstrates the potential of corpus analysis to add to our understanding of historical concerns about vaccination.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"5 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134908938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The combination of the Song Dynasty patterns and digital technology 宋代图案与数字技术的结合

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-10-26 DOI: 10.1093/llc/fqad077

Jia Hu

Abstract The Song-era artifacts carry ideas of naturalness and moderate elegance, preserving and passing them on to future generations. The emerging digital opportunities and initiatives allow us not only to study pattern samples in detail but also to create digital equivalents that are not subject to aging or destruction. This research aims to obtain specific knowledge and situational experience regarding the digitization of Song Dynasty patterns. At the preliminary research stage, the authors performed an online survey on a popular Chinese platform. The survey confirmed the interest of a wide audience in cultural heritage (CH) objects and their preservation. The respondents noted the focus on digital sources of information and considered the documentary video format an optimal educational design. In the first stage, Song patterns were digitally processed using MATLAB version 7.0 and ACDSee. In the subsequent phase, based on the processed patterns, the authors developed an instructional video. In the third stage, the researchers assessed students’ knowledge acquired by watching a video (experimental group) or attending a lecture with a presentation based on raw patterns (control group). The results obtained before and after the intervention showed significant progress in each group. It indicated the effectiveness of the digitized images and this intervention for obtaining new knowledge and raising awareness about the Song Dynasty CH.

摘要:宋代文物承载着自然和适度优雅的思想，保存并传承给后代。新兴的数字机遇和主动性使我们不仅可以详细研究模式样本，还可以创建不受老化或破坏影响的数字等级物。本研究旨在获得关于宋代纹样数字化的具体知识和情境经验。在初步研究阶段，作者在中国一个流行的平台上进行了一次在线调查。这项调查证实了广大观众对文化遗产及其保护的兴趣。受访者注意到对数字信息来源的关注，并认为纪录片视频格式是最佳的教育设计。在第一阶段，使用MATLAB 7.0和ACDSee对Song模式进行数字处理。在接下来的阶段，基于处理好的模式，作者开发了一个教学视频。在第三阶段，研究人员通过观看视频(实验组)或参加基于原始模式的演讲(对照组)来评估学生获得的知识。干预前后的结果显示各组均有显著的进步。这表明了数字化图像的有效性，以及这种干预对获取新知识和提高对宋代CH的认识的有效性。

{"title":"The combination of the Song Dynasty patterns and digital technology","authors":"Jia Hu","doi":"10.1093/llc/fqad077","DOIUrl":"https://doi.org/10.1093/llc/fqad077","url":null,"abstract":"Abstract The Song-era artifacts carry ideas of naturalness and moderate elegance, preserving and passing them on to future generations. The emerging digital opportunities and initiatives allow us not only to study pattern samples in detail but also to create digital equivalents that are not subject to aging or destruction. This research aims to obtain specific knowledge and situational experience regarding the digitization of Song Dynasty patterns. At the preliminary research stage, the authors performed an online survey on a popular Chinese platform. The survey confirmed the interest of a wide audience in cultural heritage (CH) objects and their preservation. The respondents noted the focus on digital sources of information and considered the documentary video format an optimal educational design. In the first stage, Song patterns were digitally processed using MATLAB version 7.0 and ACDSee. In the subsequent phase, based on the processed patterns, the authors developed an instructional video. In the third stage, the researchers assessed students’ knowledge acquired by watching a video (experimental group) or attending a lecture with a presentation based on raw patterns (control group). The results obtained before and after the intervention showed significant progress in each group. It indicated the effectiveness of the digitized images and this intervention for obtaining new knowledge and raising awareness about the Song Dynasty CH.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136377294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Machine learning and data analysis for word segmentation of classical Chinese poems: illustrations with Tang and Song examples 中文古诗分词的机器学习与数据分析:唐宋例证

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-10-20 DOI: 10.1093/llc/fqad073

Chao-Lin Liu, Wei-Ting Chang, Chang-Ting Chu, Ti-Yong Zheng

Abstract Words are essential parts for understanding classical Chinese poems. We report a collection of 32,399 classical Chinese poems that were annotated with word boundaries. Statistics about the annotated poems support a few heuristic experiences, including the patterns of lines and a practice for the parallel structures (對仗), that researchers of Chinese literature discuss in the literature. The annotators were affiliated with two universities, so they could annotate the poems as independently as possible. Results of an inter-rater agreement study indicate that the annotators have consensus over the identified words 93 per cent of the time and have perfect consensus for the segmentation of a poem 42 per cent of the time. We applied unsupervised classification methods to annotate the poems in several different settings, and evaluated the results with human annotations. Under favorable conditions, the classifier identified about 88 per cent of the words, and segmented poems perfectly 22 per cent of the time.

摘要词是理解古诗的重要组成部分。我们报告了32399首有词边界注释的中国古典诗歌。关于注释诗歌的统计数据支持一些启发式经验，包括中国文学研究者在文献中讨论的线条模式和平行结构的实践。注释者隶属于两所大学，因此他们可以尽可能独立地注释诗歌。一项校际协议研究的结果表明，注释者在93%的时间里对识别的单词达成了共识，在42%的时间里对一首诗的分段达成了完美的共识。我们应用无监督分类方法对不同环境下的诗歌进行了注释，并对人工注释的结果进行了评估。在良好的条件下，分类器识别了大约88%的单词，并在22%的时间里完美地分割了诗歌。

引用次数: 0

Research on character tone trend clustering of Kunqu Opera based on quantum adaptive genetic algorithm 基于量子自适应遗传算法的昆曲字调趋势聚类研究

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-10-19 DOI: 10.1093/llc/fqad074

Rui Tian, Ruheng Yin, Junrong Ban

Abstract Kunqu, one of the oldest forms of Chinese opera, features a unique artistic expression arising from the interplay between vocal melody and the tonal quality of its lyrics. Identifying Kunqu’s character tone trend (vocal melodies derived from tonal quality of the lyrics) is critical to understanding and preserving this art form. Traditional research methods, which rely on qualitative descriptions by musicologists, have often been debated due to their subjective nature. In this study, we present a novel approach to analyze the character tone trend in Kunqu by employing computer modeling machine learning techniques. By extracting the character tone trend of Kunqu using computational modeling methods and employing machine learning techniques to apply cluster analysis on Kunqu’s character tone melody, our model uncovers musical structural patterns between singing and speech, validating and refining the qualitative findings of musicologists. Furthermore, our model can automatically assess whether a piece adheres to the rhythmic norms of ‘the integration of literature and music’ in Kunqu, thus contributing to the digitization, creation, and preservation of this important cultural heritage.

昆曲是中国最古老的戏曲形式之一，其独特的艺术表现形式是声乐旋律与歌词音质的相互作用。识别昆曲的音色趋势(由歌词的音质衍生出的声乐旋律)对于理解和保存昆曲这一艺术形式至关重要。传统的研究方法依赖于音乐学家的定性描述，由于其主观性而经常受到争议。在本研究中，我们提出了一种利用计算机建模和机器学习技术分析昆曲声调趋势的新方法。通过计算建模方法提取昆曲的音色趋势，并利用机器学习技术对昆曲的音色旋律进行聚类分析，我们的模型揭示了唱歌和说话之间的音乐结构模式，验证和完善了音乐学家的定性发现。此外，我们的模型可以自动评估一首作品是否符合昆曲“文乐融合”的节奏规范，从而为这一重要文化遗产的数字化、创作和保护做出贡献。

{"title":"Research on character tone trend clustering of Kunqu Opera based on quantum adaptive genetic algorithm","authors":"Rui Tian, Ruheng Yin, Junrong Ban","doi":"10.1093/llc/fqad074","DOIUrl":"https://doi.org/10.1093/llc/fqad074","url":null,"abstract":"Abstract Kunqu, one of the oldest forms of Chinese opera, features a unique artistic expression arising from the interplay between vocal melody and the tonal quality of its lyrics. Identifying Kunqu’s character tone trend (vocal melodies derived from tonal quality of the lyrics) is critical to understanding and preserving this art form. Traditional research methods, which rely on qualitative descriptions by musicologists, have often been debated due to their subjective nature. In this study, we present a novel approach to analyze the character tone trend in Kunqu by employing computer modeling machine learning techniques. By extracting the character tone trend of Kunqu using computational modeling methods and employing machine learning techniques to apply cluster analysis on Kunqu’s character tone melody, our model uncovers musical structural patterns between singing and speech, validating and refining the qualitative findings of musicologists. Furthermore, our model can automatically assess whether a piece adheres to the rhythmic norms of ‘the integration of literature and music’ in Kunqu, thus contributing to the digitization, creation, and preservation of this important cultural heritage.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135666913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

One-third of a century on: the state of the art, pitfalls, and the way ahead relating to digital humanities approaches to translation and interpreting studies 三分之一个世纪过去了:翻译和口译研究中数字人文学科方法的现状、陷阱和未来发展方向

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-10-18 DOI: 10.1093/llc/fqad076

Chonglong Gu

Abstract The year 1993 represents a momentous milestone in the not-so-long history of translation and interpreting studies (TIS). The foundational paper published by Mona Baker entitled ‘Corpus linguistics and translation studies: Implications and applications’ in 1993 has signalled a defining moment in the application of digital humanities (DH) approaches in TIS. Since then, corpus-based TIS, as a most visible manifestation of DH in TIS, has come into being and is now gradually entering into maturity. Compared with the previously largely anecdotal, impressionist, and prescriptivist accounts of translation and interpreting, the incorporation of DH tools (e.g. CL) has significantly enriched TIS with new perspectives. This makes it possible for researchers to explore the various aspects of translation and interpreting in a more objective and systematic way, drawing on real-world data. Now one-third of a century has passed since the publication of Baker’s seminal paper, DH-inspired studies of translation and interpreting are in full swing. As we are reaching the 30-year mark of the influential publication, it is important for us to take stock of the previous achievements and look to the future both with pride and a cool head. In this article, we trace the developments of a DH approach to TIS and present the state of the art, before discussing some of the limitations and pitfalls and the road ahead going forward.

摘要1993年是翻译研究史上具有里程碑意义的一年。1993年，Mona Baker发表了题为“语料库语言学和翻译研究:影响和应用”的基础论文，标志着数字人文学科(DH)方法在TIS中的应用的决定性时刻。自此，基于语料库的信息交换作为DH在信息交换中最明显的表现形式应运而生，并逐渐走向成熟。与以前主要是轶事、印象派和规范主义的翻译和口译相比，DH工具(例如CL)的结合以新的视角显著丰富了TIS。这使得研究人员可以利用现实世界的数据，以更客观、更系统的方式探索笔译和口译的各个方面。自贝克的开创性论文发表以来，三分之一个世纪已经过去了，受博士启发的翻译和口译研究正如火如荼地进行着。在我们即将迎来这本有影响力的刊物的30周年纪念之际，重要的是我们要总结过去的成就，并以自豪和冷静的头脑展望未来。在本文中，我们追溯了DH方法在TIS中的发展，并介绍了最新的技术状况，然后讨论了一些限制和陷阱以及前进的道路。

{"title":"One-third of a century on: the state of the art, pitfalls, and the way ahead relating to digital humanities approaches to translation and interpreting studies","authors":"Chonglong Gu","doi":"10.1093/llc/fqad076","DOIUrl":"https://doi.org/10.1093/llc/fqad076","url":null,"abstract":"Abstract The year 1993 represents a momentous milestone in the not-so-long history of translation and interpreting studies (TIS). The foundational paper published by Mona Baker entitled ‘Corpus linguistics and translation studies: Implications and applications’ in 1993 has signalled a defining moment in the application of digital humanities (DH) approaches in TIS. Since then, corpus-based TIS, as a most visible manifestation of DH in TIS, has come into being and is now gradually entering into maturity. Compared with the previously largely anecdotal, impressionist, and prescriptivist accounts of translation and interpreting, the incorporation of DH tools (e.g. CL) has significantly enriched TIS with new perspectives. This makes it possible for researchers to explore the various aspects of translation and interpreting in a more objective and systematic way, drawing on real-world data. Now one-third of a century has passed since the publication of Baker’s seminal paper, DH-inspired studies of translation and interpreting are in full swing. As we are reaching the 30-year mark of the influential publication, it is important for us to take stock of the previous achievements and look to the future both with pride and a cool head. In this article, we trace the developments of a DH approach to TIS and present the state of the art, before discussing some of the limitations and pitfalls and the road ahead going forward.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135889024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Who could be behind QAnon? Authorship attribution with supervised machine-learning 谁是QAnon的幕后黑手?作者归属与监督机器学习

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-10-18 DOI: 10.1093/llc/fqad061

Florian Cafiero, Jean-Baptiste Camps

Abstract A series of social media posts on 4chan then 8chan, signed under the pseudonym ‘Q’, started a movement known as QAnon, which led some of its most radical supporters to violent and illegal actions. To identify the person(s) behind Q, we evaluate the coincidence between the linguistic properties of the texts written by Q and to those written by a list of suspects provided by journalistic investigation. To identify the authors of these posts, serious challenges have to be addressed. The ‘Q drops’ are very short texts, written in a way that constitute a sort of literary genre in itself, with very peculiar features of style. These texts might have been written by different authors, whose other writings are often hard to find. After an online ethnography of the movement, necessary to collect enough material written by these thirteen potential authors, we use supervised machine learning to build stylistic profiles for each of them. We then performed a ‘rolling analysis’, looking repeatedly through a moving window for parts of Q’s writings matching our profiles. We conclude that two different individuals, Paul F. and Ron W., are the closest match to Q’s linguistic signature, and they could have successively written Q’s texts. These potential authors are not high-ranked personality from the US administration, but rather social media activists.

一系列在4chan和8chan上以化名“Q”署名的社交媒体帖子，引发了一场名为QAnon的运动，导致一些最激进的支持者采取了暴力和非法行动。为了确定Q背后的人，我们评估了Q所写文本的语言特性与新闻调查提供的嫌疑人名单所写文本的语言特性之间的一致性。为了确定这些帖子的作者，必须解决严重的挑战。“Q滴”是非常短的文本，其写作方式本身就构成了一种文学体裁，具有非常独特的风格特征。这些文本可能是由不同的作者写的，他们的其他作品通常很难找到。在对这一运动进行了在线人种志研究之后，有必要收集这13位潜在作者写的足够材料，我们使用监督式机器学习为他们每个人建立风格档案。然后，我们进行了“滚动分析”，通过移动窗口反复查看Q的文章中与我们的资料相符的部分。我们得出的结论是，两个不同的人，保罗·f和罗恩·W，是最接近Q的语言特征的人，他们可能先后写过Q的文本。这些潜在的作者并不是来自美国政府的高层人士，而是社交媒体活动家。

{"title":"Who could be behind QAnon? Authorship attribution with supervised machine-learning","authors":"Florian Cafiero, Jean-Baptiste Camps","doi":"10.1093/llc/fqad061","DOIUrl":"https://doi.org/10.1093/llc/fqad061","url":null,"abstract":"Abstract A series of social media posts on 4chan then 8chan, signed under the pseudonym ‘Q’, started a movement known as QAnon, which led some of its most radical supporters to violent and illegal actions. To identify the person(s) behind Q, we evaluate the coincidence between the linguistic properties of the texts written by Q and to those written by a list of suspects provided by journalistic investigation. To identify the authors of these posts, serious challenges have to be addressed. The ‘Q drops’ are very short texts, written in a way that constitute a sort of literary genre in itself, with very peculiar features of style. These texts might have been written by different authors, whose other writings are often hard to find. After an online ethnography of the movement, necessary to collect enough material written by these thirteen potential authors, we use supervised machine learning to build stylistic profiles for each of them. We then performed a ‘rolling analysis’, looking repeatedly through a moving window for parts of Q’s writings matching our profiles. We conclude that two different individuals, Paul F. and Ron W., are the closest match to Q’s linguistic signature, and they could have successively written Q’s texts. These potential authors are not high-ranked personality from the US administration, but rather social media activists.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135889025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A quantitative window on the history of statistics: topic-modelling 120 years of Biometrika 统计历史的定量窗口:生物计量学120年的主题建模

3区文学 0 HUMANITIES, MULTIDISCIPLINARY

Digital Scholarship in the Humanities

Pub Date : 2023-10-13 DOI: 10.1093/llc/fqad072

Nicola Bertoldi, Francis Lareau, Charles H Pence, Christophe Malaterre

Abstract As one of the oldest continuously publishing journals in statistics (published since 1901), Biometrika provides a unique window onto the history of statistics and its epistemic development throughout the 20th and the beginning of the 21st centuries. While the early history of the discipline, with the works of key figures, such as Karl Pearson, Francis Galton, or Ronald Fisher, is relatively well known, the later (and longer) episodes of its intellectual development remain understudied. By applying digital tools to the full-text corpus of the journal articles (N = 5,596), the objective of this study is to provide a novel quantitative exploration of the history of the statistical sciences via an all-encompassing view of 120 years of Biometrika. To this aim, topic-modelling analyses are used and provide insights into the epistemic content of the journal and its evolution. Striking changes in the thematic content of the journal are documented and quantified for the first time, from the decline of Pearsonian and Weldonian biometrical research and the journal’s tight connection to biology in the 1930s to the rise of modern statistical methods beginning in the 1960s and 1970s. Newly developed approaches are used to infer author networks from publication topics. The resulting network of authors shows the existence of several communities, well-aligned with topic clusters and their evolution through time. It also highlights the role of specific figures over more than a century of publishing history and provides a first window onto the foundation, development, and diverse applications of the statistical sciences.

作为统计领域最古老的连续出版期刊之一(自1901年出版以来)，Biometrika提供了一个独特的窗口，了解统计的历史及其在整个20世纪和21世纪初的认知发展。虽然这门学科的早期历史，包括卡尔·皮尔森、弗朗西斯·高尔顿或罗纳德·费雪等关键人物的作品，相对来说是众所周知的，但它的智力发展的后期(和更长的)情节仍然没有得到充分的研究。通过将数字工具应用于期刊文章的全文语料库(N = 5,596)，本研究的目的是通过对生物计量学120年的全面回顾，为统计科学的历史提供一种新的定量探索。为此目的，使用主题建模分析，并提供对期刊及其演变的认知内容的见解。从20世纪30年代Pearsonian和Weldonian生物计量学研究的衰落以及该杂志与生物学的紧密联系，到20世纪60年代和70年代开始的现代统计方法的兴起，该杂志的主题内容的显著变化首次被记录和量化。新开发的方法用于从出版主题推断作者网络。由此产生的作者网络显示了几个社区的存在，这些社区与主题集群及其随时间的演变非常一致。它还强调了在一个多世纪的出版历史中具体数字的作用，并提供了统计科学的基础，发展和各种应用的第一个窗口。

{"title":"A quantitative window on the history of statistics: topic-modelling 120 years of <i>Biometrika</i>","authors":"Nicola Bertoldi, Francis Lareau, Charles H Pence, Christophe Malaterre","doi":"10.1093/llc/fqad072","DOIUrl":"https://doi.org/10.1093/llc/fqad072","url":null,"abstract":"Abstract As one of the oldest continuously publishing journals in statistics (published since 1901), Biometrika provides a unique window onto the history of statistics and its epistemic development throughout the 20th and the beginning of the 21st centuries. While the early history of the discipline, with the works of key figures, such as Karl Pearson, Francis Galton, or Ronald Fisher, is relatively well known, the later (and longer) episodes of its intellectual development remain understudied. By applying digital tools to the full-text corpus of the journal articles (N = 5,596), the objective of this study is to provide a novel quantitative exploration of the history of the statistical sciences via an all-encompassing view of 120 years of Biometrika. To this aim, topic-modelling analyses are used and provide insights into the epistemic content of the journal and its evolution. Striking changes in the thematic content of the journal are documented and quantified for the first time, from the decline of Pearsonian and Weldonian biometrical research and the journal’s tight connection to biology in the 1930s to the rise of modern statistical methods beginning in the 1960s and 1970s. Newly developed approaches are used to infer author networks from publication topics. The resulting network of authors shows the existence of several communities, well-aligned with topic clusters and their evolution through time. It also highlights the role of specific figures over more than a century of publishing history and provides a first window onto the foundation, development, and diverse applications of the statistical sciences.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135918922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Digital Scholarship in the Humanities

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀