{"title":"Computer-Assisted Corpus Analysis: An Introduction to Concepts, Processes, and Decisions","authors":"Susan Lang;Duncan A. Buell;Norbert Elliot","doi":"10.1109/TPC.2022.3228026","DOIUrl":null,"url":null,"abstract":"<bold>Problem:</b>\n This tutorial aims to guide readers through key concepts, basic processes, and common decision points that inform computer-assisted corpus-based research in technical, professional, and scientific communication (TPSC). \n<bold>Key concepts:</b>\n Based on our collaborative experiences and an example developed for this tutorial, key concepts of corpus analysis useful to TPSC researchers and practitioners include the following: corpus location, text preparation, and programming language and software selection. \n<bold>Key lessons:</b>\n These key concepts can be used to establish basic processes and decision points that, in turn, yield lessons related to the usefulness of lexicogrammatical language models and the significance of multidisciplinarity. \n<bold>Implications:</b>\n Although corpus research is a growing and important part of the field of TPSC, challenges remain in terms of language model variety and ethical considerations. At least in part, these challenges can be met, respectively, by alignment between corpus and analytic tools and reference to the Common Rule and related international standards.","PeriodicalId":46950,"journal":{"name":"IEEE Transactions on Professional Communication","volume":"66 1","pages":"94-113"},"PeriodicalIF":1.6000,"publicationDate":"2023-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Professional Communication","FirstCategoryId":"98","ListUrlMain":"https://ieeexplore.ieee.org/document/10045048/","RegionNum":2,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMMUNICATION","Score":null,"Total":0}
引用次数: 1
Abstract
Problem:
This tutorial aims to guide readers through key concepts, basic processes, and common decision points that inform computer-assisted corpus-based research in technical, professional, and scientific communication (TPSC).
Key concepts:
Based on our collaborative experiences and an example developed for this tutorial, key concepts of corpus analysis useful to TPSC researchers and practitioners include the following: corpus location, text preparation, and programming language and software selection.
Key lessons:
These key concepts can be used to establish basic processes and decision points that, in turn, yield lessons related to the usefulness of lexicogrammatical language models and the significance of multidisciplinarity.
Implications:
Although corpus research is a growing and important part of the field of TPSC, challenges remain in terms of language model variety and ethical considerations. At least in part, these challenges can be met, respectively, by alignment between corpus and analytic tools and reference to the Common Rule and related international standards.
期刊介绍:
The IEEE Transactions on Professional Communication is a peer-reviewed journal devoted to applied research on professional communication—including but not limited to technical and business communication. Papers should address the research interests and needs of technical communicators, engineers, scientists, information designers, editors, linguists, translators, managers, business professionals, and others from around the globe who practice, conduct research on, and teach others about effective professional communication. The Transactions publishes original, empirical research that addresses one of these contexts: The communication practices of technical professionals, such as engineers and scientists The practices of professional communicators who work in technical or business environments Evidence-based methods for teaching and practicing professional and technical communication.