Anonymity Effects: A Large-Scale Dataset from an Anonymous Social Media Platform

Mainack Mondal, D. Correa, Fabrício Benevenuto
{"title":"Anonymity Effects: A Large-Scale Dataset from an Anonymous Social Media Platform","authors":"Mainack Mondal, D. Correa, Fabrício Benevenuto","doi":"10.1145/3372923.3404792","DOIUrl":null,"url":null,"abstract":"Today online social media sites function as the medium of expression for billions of users. As a result, aside from conventional social media sites like Facebook and Twitter, platform designers introduced many alternative social media platforms (e.g., 4chan, Whisper, Snapchat, Mastodon) to serve specific userbases. Among these platforms, anonymous social media sites like Whisper and 4chan hold a special place for researchers. Unlike conventional social media sites, posts on anonymous social media sites are not associated with persistent user identities or profiles. Thus, these anonymous social media sites can provide an extremely interesting data-driven lens into the effects of anonymity on online user behavior. However, to the best of our knowledge, currently there are no publicly available datasets to facilitate research efforts on these anonymity effects. To that end, in this paper, we aim to publicly release the first ever large-scale dataset from Whisper, a large anonymous online social media platform. Specifically, our dataset contains 89.8 Million Whisper posts (called \"whispers'') published between a 2-year period from June 6, 2014 to June 6, 2016 (when Whisper was quite popular). Each of these whispers contained both post text and associated metadata. The metadata contains information like coarse-grained location of upload and categories of whispers. We also present preliminary descriptive statistics to demonstrate a significant language and categorical diversity in our dataset. We leverage previous work as well as novel analysis to demonstrate that the whispers contain personal emotions and opinions (likely facilitated by a disinhibition complex due to anonymity). Consequently, we envision that our dataset will facilitate novel research ranging from understanding online aggression to detect depression within online populace.","PeriodicalId":389616,"journal":{"name":"Proceedings of the 31st ACM Conference on Hypertext and Social Media","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 31st ACM Conference on Hypertext and Social Media","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3372923.3404792","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Today online social media sites function as the medium of expression for billions of users. As a result, aside from conventional social media sites like Facebook and Twitter, platform designers introduced many alternative social media platforms (e.g., 4chan, Whisper, Snapchat, Mastodon) to serve specific userbases. Among these platforms, anonymous social media sites like Whisper and 4chan hold a special place for researchers. Unlike conventional social media sites, posts on anonymous social media sites are not associated with persistent user identities or profiles. Thus, these anonymous social media sites can provide an extremely interesting data-driven lens into the effects of anonymity on online user behavior. However, to the best of our knowledge, currently there are no publicly available datasets to facilitate research efforts on these anonymity effects. To that end, in this paper, we aim to publicly release the first ever large-scale dataset from Whisper, a large anonymous online social media platform. Specifically, our dataset contains 89.8 Million Whisper posts (called "whispers'') published between a 2-year period from June 6, 2014 to June 6, 2016 (when Whisper was quite popular). Each of these whispers contained both post text and associated metadata. The metadata contains information like coarse-grained location of upload and categories of whispers. We also present preliminary descriptive statistics to demonstrate a significant language and categorical diversity in our dataset. We leverage previous work as well as novel analysis to demonstrate that the whispers contain personal emotions and opinions (likely facilitated by a disinhibition complex due to anonymity). Consequently, we envision that our dataset will facilitate novel research ranging from understanding online aggression to detect depression within online populace.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
匿名效应:来自匿名社交媒体平台的大规模数据集
今天,在线社交媒体网站是数十亿用户的表达媒介。因此,除了像Facebook和Twitter这样的传统社交媒体网站外,平台设计师还引入了许多替代社交媒体平台(如4chan, Whisper, Snapchat, Mastodon)来服务特定的用户群。在这些平台中,Whisper和4chan等匿名社交媒体网站为研究人员提供了一个特殊的位置。与传统的社交媒体网站不同,匿名社交媒体网站上的帖子与持久的用户身份或个人资料无关。因此,这些匿名的社交媒体网站可以为匿名对在线用户行为的影响提供一个非常有趣的数据驱动镜头。然而,据我们所知,目前还没有公开可用的数据集来促进对这些匿名效应的研究。为此,在本文中,我们的目标是公开发布来自大型匿名在线社交媒体平台Whisper的第一个大规模数据集。具体来说,我们的数据集包含了2014年6月6日至2016年6月6日(当时Whisper非常受欢迎)这两年期间发布的8980万个Whisper帖子(称为“whispers”)。每个耳语都包含帖子文本和相关的元数据。元数据包含诸如上传的粗粒度位置和耳语的类别之类的信息。我们还提出了初步的描述性统计,以证明在我们的数据集中有显著的语言和分类多样性。我们利用以前的工作以及新的分析来证明耳语包含个人情绪和观点(可能是由于匿名而产生的去抑制情结)。因此,我们设想我们的数据集将促进新的研究,从理解在线攻击到检测在线人群中的抑郁。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Date the Artist: A Virtual Date with a Virtual Character 3rd Workshop on Human Factors in Hypertext (HUMAN'20) Noise-Enhanced Community Detection You Do Not Decide for Me! Evaluating Explainable Group Aggregation Strategies for Tourism Personalizing Information Exploration with an Open User Model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1