{"title":"From corpus creation to formative discovery: the power of big-data-rhetoric teams and methods","authors":"Sarah E. Ryan, Lingzi Hong, Mohotarema Rashid","doi":"10.1080/15358593.2022.2119094","DOIUrl":null,"url":null,"abstract":"ABSTRACT Rhetoric has been slow to adopt big-data techniques, but that is changing. In this article, we describe the formative work of our rhetoric-data science team on an ideographic analysis of state veteran laws. Our interdisciplinary approach enabled us to build a corpus of more than 7,000 files, segment that corpus into likely public and private laws, and develop dictionaries for discerning individual entitlements, such as waived fees for gun permits. Early results show state-level trends in the number of veteran laws, proportion of veteran laws concerning disabled veterans, and proportion of veteran/disability laws affording individual entitlements. While this article presents early findings, its broader purpose is to contribute to discussions of corpus building, data cleaning, formative analysis, and the value of big-data-rhetoric collaborations. Our experience provides five insights: (1) big-data collection methods can save a public rhetoric project when customary retrieval methods fail; (2) big-data-rhetoric work starts conceptually and becomes concretized; (3) formative big-data rhetoric work can problematize fundamental research assumptions, such as what should be included in a corpus; (4) big-data methods can produce interesting results early, yielding a roadmap for future work; and (5) big-data-rhetoric teams need more guidance from the field.","PeriodicalId":53587,"journal":{"name":"Review of Communication","volume":"85 22","pages":"38 - 61"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Review of Communication","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/15358593.2022.2119094","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 3
Abstract
ABSTRACT Rhetoric has been slow to adopt big-data techniques, but that is changing. In this article, we describe the formative work of our rhetoric-data science team on an ideographic analysis of state veteran laws. Our interdisciplinary approach enabled us to build a corpus of more than 7,000 files, segment that corpus into likely public and private laws, and develop dictionaries for discerning individual entitlements, such as waived fees for gun permits. Early results show state-level trends in the number of veteran laws, proportion of veteran laws concerning disabled veterans, and proportion of veteran/disability laws affording individual entitlements. While this article presents early findings, its broader purpose is to contribute to discussions of corpus building, data cleaning, formative analysis, and the value of big-data-rhetoric collaborations. Our experience provides five insights: (1) big-data collection methods can save a public rhetoric project when customary retrieval methods fail; (2) big-data-rhetoric work starts conceptually and becomes concretized; (3) formative big-data rhetoric work can problematize fundamental research assumptions, such as what should be included in a corpus; (4) big-data methods can produce interesting results early, yielding a roadmap for future work; and (5) big-data-rhetoric teams need more guidance from the field.