Anthony-Paul Cooper, Dr. Emmanuel Kolog Awuni, E. Sutinen
{"title":"Exploring the Use of Machine Learning to Automate the Qualitative Coding of Church-related Tweets","authors":"Anthony-Paul Cooper, Dr. Emmanuel Kolog Awuni, E. Sutinen","doi":"10.1558/firn.40610","DOIUrl":null,"url":null,"abstract":"This article builds on previous research around the exploration of the content of church-related tweets. It does so by exploring whether the qualitative thematic coding of such tweets can, in part, be automated by the use of machine learning. It compares three supervised machine learning algorithms to understand how useful each algorithm is at a classification task, based on a dataset of human-coded church-related tweets. The study finds that one such algorithm, Naïve-Bayes, performs better than the other algorithms considered, returning Precision, Recall and F-measure values which each exceed an acceptable threshold of 70%. This has far-reaching consequences at a time where the high volume of social media data, in this case, Twitter data, means that the resource-intensity of manual coding approaches can act as a barrier to understanding how the online community interacts with, and talks about, church. The findings presented in this article offer a way forward for scholars of digital theology to better understand the content of online church discourse.","PeriodicalId":41468,"journal":{"name":"Fieldwork in Religion","volume":" ","pages":""},"PeriodicalIF":0.4000,"publicationDate":"2020-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fieldwork in Religion","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1558/firn.40610","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"RELIGION","Score":null,"Total":0}
引用次数: 4
Abstract
This article builds on previous research around the exploration of the content of church-related tweets. It does so by exploring whether the qualitative thematic coding of such tweets can, in part, be automated by the use of machine learning. It compares three supervised machine learning algorithms to understand how useful each algorithm is at a classification task, based on a dataset of human-coded church-related tweets. The study finds that one such algorithm, Naïve-Bayes, performs better than the other algorithms considered, returning Precision, Recall and F-measure values which each exceed an acceptable threshold of 70%. This has far-reaching consequences at a time where the high volume of social media data, in this case, Twitter data, means that the resource-intensity of manual coding approaches can act as a barrier to understanding how the online community interacts with, and talks about, church. The findings presented in this article offer a way forward for scholars of digital theology to better understand the content of online church discourse.
期刊介绍:
Fieldwork in Religion (FIR) is a peer reviewed, interdisciplinary journal seeking engagement between scholars carrying out empirical research in religion. It will consider articles from established scholars and research students. The purpose of Fieldwork in Religion is to promote critical investigation into all aspects of the empirical study of contemporary religion. The journal is interdisciplinary in that it is not limited to the fields of anthropology and ethnography. Fieldwork in Religion seeks to promote empirical study of religion in all disciplines: religious studies, anthropology, ethnography, sociology, psychology, folklore, or cultural studies. A further important aim of Fieldwork in Religion is to encourage the discussion of methodology in fieldwork either through discrete articles on issues of methodology or by publishing fieldwork case studies that include methodological challenges and the impact of methodology on the results of empirical research.