VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter

Yida Mu, Mali Jin, Charles Grimshaw, Carolina Scarton, Kalina Bontcheva, Xingyi Song
{"title":"VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on Twitter","authors":"Yida Mu, Mali Jin, Charles Grimshaw, Carolina Scarton, Kalina Bontcheva, Xingyi Song","doi":"10.48550/arXiv.2301.06660","DOIUrl":null,"url":null,"abstract":"Vaccine hesitancy has been a common concern, probably since vaccines were created and, with the popularisation of social media, people started to express their concerns about vaccines online alongside those posting pro- and anti-vaccine content. Predictably, since the first mentions of a COVID-19 vaccine, social media users posted about their fears and concerns or about their support and belief into the effectiveness of these rapidly developing vaccines. Identifying and understanding the reasons behind public hesitancy towards COVID-19 vaccines is important for policy markers that need to develop actions to better inform the population with the aim of increasing vaccine take-up. In the case of COVID-19, where the fast development of the vaccines was mirrored closely by growth in anti-vaxx disinformation, automatic means of detecting citizen attitudes towards vaccination became necessary. This is an important computational social sciences task that requires data analysis in order to gain in-depth understanding of the phenomena at hand. Annotated data is also necessary for training data-driven models for more nuanced analysis of attitudes towards vaccination. To this end, we created a new collection of over 3,101 tweets annotated with users' attitudes towards COVID-19 vaccination (stance). Besides, we also develop a domain-specific language model (VaxxBERT) that achieves the best predictive performance (73.0 accuracy and 69.3 F1-score) as compared to a robust set of baselines. To the best of our knowledge, these are the first dataset and model that model vaccine hesitancy as a category distinct from pro- and anti-vaccine stance.","PeriodicalId":175641,"journal":{"name":"International Conference on Web and Social Media","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Web and Social Media","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2301.06660","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Vaccine hesitancy has been a common concern, probably since vaccines were created and, with the popularisation of social media, people started to express their concerns about vaccines online alongside those posting pro- and anti-vaccine content. Predictably, since the first mentions of a COVID-19 vaccine, social media users posted about their fears and concerns or about their support and belief into the effectiveness of these rapidly developing vaccines. Identifying and understanding the reasons behind public hesitancy towards COVID-19 vaccines is important for policy markers that need to develop actions to better inform the population with the aim of increasing vaccine take-up. In the case of COVID-19, where the fast development of the vaccines was mirrored closely by growth in anti-vaxx disinformation, automatic means of detecting citizen attitudes towards vaccination became necessary. This is an important computational social sciences task that requires data analysis in order to gain in-depth understanding of the phenomena at hand. Annotated data is also necessary for training data-driven models for more nuanced analysis of attitudes towards vaccination. To this end, we created a new collection of over 3,101 tweets annotated with users' attitudes towards COVID-19 vaccination (stance). Besides, we also develop a domain-specific language model (VaxxBERT) that achieves the best predictive performance (73.0 accuracy and 69.3 F1-score) as compared to a robust set of baselines. To the best of our knowledge, these are the first dataset and model that model vaccine hesitancy as a category distinct from pro- and anti-vaccine stance.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
vaxx犹豫不决:一个研究推特上对COVID-19疫苗接种犹豫不决的数据集
疫苗犹豫一直是一个普遍的问题,可能自从疫苗发明以来,随着社交媒体的普及,人们开始在网上表达他们对疫苗的担忧,同时发布支持和反对疫苗的内容。可以预见的是,自从第一次提到COVID-19疫苗以来,社交媒体用户发布了他们的恐惧和担忧,或者他们对这些快速发展的疫苗的有效性的支持和信念。确定和了解公众对COVID-19疫苗犹豫不决背后的原因,对于需要制定行动以更好地告知民众以提高疫苗接种率的政策制定者来说非常重要。在COVID-19的情况下,疫苗的快速发展与反vaxx虚假信息的增长密切相关,因此有必要采用自动手段检测公民对疫苗接种的态度。这是一项重要的计算社会科学任务,需要数据分析,以便深入了解手头的现象。带注释的数据对于训练数据驱动的模型也是必要的,以便更细致地分析对疫苗接种的态度。为此,我们创建了一个包含3101多条推文的新集合,其中标注了用户对COVID-19疫苗接种的态度(立场)。此外,我们还开发了一个领域特定的语言模型(VaxxBERT),与一组稳健的基线相比,该模型实现了最佳的预测性能(准确率为73.0,f1得分为69.3)。据我们所知,这是第一个将疫苗犹豫作为不同于支持和反对疫苗立场的类别进行建模的数据集和模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
RTANet: Recommendation Target-Aware Network Embedding Who Is behind a Trend? Temporal Analysis of Interactions among Trend Participants on Twitter Host-Centric Social Connectedness of Migrants in Europe on Facebook Recipe Networks and the Principles of Healthy Food on the Web Social Influence-Maximizing Group Recommendation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1