编译和分析大量在线讨论语料库,以探索用户交互

Shi Min CHUA
{"title":"编译和分析大量在线讨论语料库,以探索用户交互","authors":"Shi Min CHUA","doi":"10.1016/j.acorp.2022.100017","DOIUrl":null,"url":null,"abstract":"<div><p>This methodology-focused paper reports how I compiled and analysed a 12-million-word corpus of threaded online discussions by employing Corpus Workbench tool (CWB, Evert &amp; Hardie, 2011) and combining corpus analysis with micro-analysis drawing on the principles of digital Conversation Analysis. The tool not only affords an efficient retrieval and analysis of a large dataset, but also, more importantly, facilitates exploration of a corpus of online discussions based on different variables (e.g., topics of discussions, role of internet users, types of postings) and units of analysis (e.g., subforums, threads, postings). Examples are presented to illustrate how I used this tool to investigate various aspects of online discussions, and extract threads surrounding a particular topic or language practices for micro-analysis. I propose internet users’ interactions in online discussions can be further explored in the field of corpus linguistics by using this tool and a synergy of corpus linguistics and an interactional approach.</p></div>","PeriodicalId":72254,"journal":{"name":"Applied Corpus Linguistics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266679912200003X/pdfft?md5=bc9ad1325dae08c713ab8180e4a3e150&pid=1-s2.0-S266679912200003X-main.pdf","citationCount":"2","resultStr":"{\"title\":\"Compiling and analysing a large corpus of online discussions to explore users’ interactions\",\"authors\":\"Shi Min CHUA\",\"doi\":\"10.1016/j.acorp.2022.100017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This methodology-focused paper reports how I compiled and analysed a 12-million-word corpus of threaded online discussions by employing Corpus Workbench tool (CWB, Evert &amp; Hardie, 2011) and combining corpus analysis with micro-analysis drawing on the principles of digital Conversation Analysis. The tool not only affords an efficient retrieval and analysis of a large dataset, but also, more importantly, facilitates exploration of a corpus of online discussions based on different variables (e.g., topics of discussions, role of internet users, types of postings) and units of analysis (e.g., subforums, threads, postings). Examples are presented to illustrate how I used this tool to investigate various aspects of online discussions, and extract threads surrounding a particular topic or language practices for micro-analysis. I propose internet users’ interactions in online discussions can be further explored in the field of corpus linguistics by using this tool and a synergy of corpus linguistics and an interactional approach.</p></div>\",\"PeriodicalId\":72254,\"journal\":{\"name\":\"Applied Corpus Linguistics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S266679912200003X/pdfft?md5=bc9ad1325dae08c713ab8180e4a3e150&pid=1-s2.0-S266679912200003X-main.pdf\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Corpus Linguistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S266679912200003X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Corpus Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266679912200003X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

这篇以方法为重点的论文报告了我是如何利用语料库工作台工具(CWB, Evert &Hardie, 2011),并根据数字会话分析的原则,将语料库分析与微观分析相结合。该工具不仅提供了对大型数据集的有效检索和分析,而且更重要的是,它促进了基于不同变量(例如,讨论主题、互联网用户角色、帖子类型)和分析单元(例如,子论坛、线程、帖子)的在线讨论语料库的探索。本文提供的示例说明了我如何使用该工具调查在线讨论的各个方面,并提取围绕特定主题或语言实践的线索进行微观分析。我建议利用这一工具和语料库语言学与互动方法的协同作用,在语料库语言学领域进一步探索互联网用户在在线讨论中的互动。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Compiling and analysing a large corpus of online discussions to explore users’ interactions

This methodology-focused paper reports how I compiled and analysed a 12-million-word corpus of threaded online discussions by employing Corpus Workbench tool (CWB, Evert & Hardie, 2011) and combining corpus analysis with micro-analysis drawing on the principles of digital Conversation Analysis. The tool not only affords an efficient retrieval and analysis of a large dataset, but also, more importantly, facilitates exploration of a corpus of online discussions based on different variables (e.g., topics of discussions, role of internet users, types of postings) and units of analysis (e.g., subforums, threads, postings). Examples are presented to illustrate how I used this tool to investigate various aspects of online discussions, and extract threads surrounding a particular topic or language practices for micro-analysis. I propose internet users’ interactions in online discussions can be further explored in the field of corpus linguistics by using this tool and a synergy of corpus linguistics and an interactional approach.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Corpus Linguistics
Applied Corpus Linguistics Linguistics and Language
CiteScore
1.30
自引率
0.00%
发文量
0
审稿时长
70 days
期刊最新文献
Breach of pacta sunt servanda: A corpus-assisted analysis of newspaper discourse on the AUKUS agreement Identifying ChatGPT-generated texts in EFL students’ writing: Through comparative analysis of linguistic fingerprints English podcasts for schoolchildren and their vocabulary demands Capturing chronological variation in L2 speech through lexical measurements and regression analysis Investigating spoken classroom interactions in linguistically heterogeneous learning groups – An interdisciplinary approach to process video-based data in second language acquisition classrooms
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1