学术话语中启发式语篇实践的分类

IF 1.6 2区文学 0 LANGUAGE & LINGUISTICS International Journal of Corpus Linguistics Pub Date : 2020-11-23 DOI:10.1075/ijcl.19097.bec

Maria Becker, M. Bender, Marcus Müller

{"title":"学术话语中启发式语篇实践的分类","authors":"Maria Becker, M. Bender, Marcus Müller","doi":"10.1075/ijcl.19097.bec","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate how deep learning techniques can be applied to discourse pragmatics. As a testcase we analyse heuristic textual practices, defined as linguistic implementations of decision routines in research processes in academic discourse. We develop a complex annotation scheme of pragmalinguistic categories on different levels of granularity and manually annotate a corpus of texts across various scientific disciplines. This is the basis for training recurrent neural networks to classify heuristic textual practices. Our experiments show that the annotation categories are robust enough to be recognised by our models which learn similarities of the sentence-surfaces represented as word embeddings. Our study aims at an iterative human-in-the-loop process in which manual-hermeneutic and algorithmic procedures mutually advance the insight process. It underlines the fact that the interaction between manual and automated methods opens up a promising field for further research, allowing interpretative analyses of complex pragmatic phenomena in large corpora.","PeriodicalId":46843,"journal":{"name":"International Journal of Corpus Linguistics","volume":" ","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2020-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Classifying heuristic textual practices in academic discourse\",\"authors\":\"Maria Becker, M. Bender, Marcus Müller\",\"doi\":\"10.1075/ijcl.19097.bec\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we investigate how deep learning techniques can be applied to discourse pragmatics. As a testcase we analyse heuristic textual practices, defined as linguistic implementations of decision routines in research processes in academic discourse. We develop a complex annotation scheme of pragmalinguistic categories on different levels of granularity and manually annotate a corpus of texts across various scientific disciplines. This is the basis for training recurrent neural networks to classify heuristic textual practices. Our experiments show that the annotation categories are robust enough to be recognised by our models which learn similarities of the sentence-surfaces represented as word embeddings. Our study aims at an iterative human-in-the-loop process in which manual-hermeneutic and algorithmic procedures mutually advance the insight process. It underlines the fact that the interaction between manual and automated methods opens up a promising field for further research, allowing interpretative analyses of complex pragmatic phenomena in large corpora.\",\"PeriodicalId\":46843,\"journal\":{\"name\":\"International Journal of Corpus Linguistics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2020-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Corpus Linguistics\",\"FirstCategoryId\":\"98\",\"ListUrlMain\":\"https://doi.org/10.1075/ijcl.19097.bec\",\"RegionNum\":2,\"RegionCategory\":\"文学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"LANGUAGE & LINGUISTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Corpus Linguistics","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1075/ijcl.19097.bec","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"LANGUAGE & LINGUISTICS","Score":null,"Total":0}

引用次数: 2

摘要

在本文中，我们研究了深度学习技术如何应用于话语语用学。作为一个测试案例，我们分析了启发式语篇实践，它被定义为学术话语中研究过程中决策例程的语言实现。我们开发了一个不同粒度水平的语用语言学类别的复杂注释方案，并手动注释了不同科学学科的文本语料库。这是训练递归神经网络对启发式文本实践进行分类的基础。我们的实验表明，注释类别足够强大，可以被我们的模型识别，这些模型学习了表示为单词嵌入的句子表面的相似性。我们的研究目标是一个迭代的人在循环过程，在这个过程中，手工解释学和算法程序相互推进洞察过程。它强调了这样一个事实，即手动和自动方法之间的互动为进一步研究开辟了一个很有前途的领域，允许对大型语料库中复杂的语用现象进行解释性分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Classifying heuristic textual practices in academic discourse

In this paper, we investigate how deep learning techniques can be applied to discourse pragmatics. As a testcase we analyse heuristic textual practices, defined as linguistic implementations of decision routines in research processes in academic discourse. We develop a complex annotation scheme of pragmalinguistic categories on different levels of granularity and manually annotate a corpus of texts across various scientific disciplines. This is the basis for training recurrent neural networks to classify heuristic textual practices. Our experiments show that the annotation categories are robust enough to be recognised by our models which learn similarities of the sentence-surfaces represented as word embeddings. Our study aims at an iterative human-in-the-loop process in which manual-hermeneutic and algorithmic procedures mutually advance the insight process. It underlines the fact that the interaction between manual and automated methods opens up a promising field for further research, allowing interpretative analyses of complex pragmatic phenomena in large corpora.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Corpus Linguistics Multiple-

CiteScore

3.30

自引率

0.00%

发文量

期刊介绍： The International Journal of Corpus Linguistics (IJCL) publishes original research covering methodological, applied and theoretical work in any area of corpus linguistics. Through its focus on empirical language research, IJCL provides a forum for the presentation of new findings and innovative approaches in any area of linguistics (e.g. lexicology, grammar, discourse analysis, stylistics, sociolinguistics, morphology, contrastive linguistics), applied linguistics (e.g. language teaching, forensic linguistics), and translation studies. Based on its interest in corpus methodology, IJCL also invites contributions on the interface between corpus and computational linguistics.

期刊最新文献

Review of Durrant (2023): Corpus linguistics for writing development Review of Dunn (2022): Natural Language Processing for Corpus Linguistics Review of Viana (2022): Teaching English with Corpora: A Resource Book Framing the path to net zero Political framing of Covid-19