首页 > 最新文献

Proceedings of the International AAAI Conference on Web and Social Media最新文献

英文 中文
Statement of Removal 搬迁声明
Pub Date : 2024-03-18 DOI: 10.1609/icwsm.v13i01.22003
Matteo Zignani, C. Quadri, Alessia Galdeman, S. Gaito, G. P. Rossi
This Statement of Removal refers to:  Mastodon Content Warnings: Inappropriate Contents in a Microblogging Platform
本移除声明涉及 Mastodon 内容警告:微博平台中的不当内容
{"title":"Statement of Removal","authors":"Matteo Zignani, C. Quadri, Alessia Galdeman, S. Gaito, G. P. Rossi","doi":"10.1609/icwsm.v13i01.22003","DOIUrl":"https://doi.org/10.1609/icwsm.v13i01.22003","url":null,"abstract":"This Statement of Removal refers to:  \u0000Mastodon Content Warnings: Inappropriate Contents in a Microblogging Platform","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"311 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140232999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
"A Special Operation": A Quantitative Approach to Dissecting and Comparing Different Media Ecosystems’ Coverage of the Russo-Ukrainian War “一次特殊行动”:分析和比较不同媒体生态系统对俄乌战争报道的定量方法
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22150
Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric
The coverage of the Russian invasion of Ukraine has varied widely between Western, Russian, and Chinese media ecosystems with propaganda, disinformation, and narrative spins present in all three. By utilizing the normalized pointwise mutual information metric, differential sentiment analysis, word2vec models, and partially labeled Dirichlet allocation, we present a quantitative analysis of the differences in coverage amongst these three news ecosystems. We find that while the Western press outlets have focused on the military and humanitarian aspects of the war, Russian media have focused on the purported justifications for the “special military operation” such as the presence in Ukraine of “bio-weapons” and “neo-nazis”, and Chinese news media have concentrated on the conflict’s diplomatic and economic consequences. Detecting the presence of several Russian disinformation narratives in the articles of several Chinese media outlets, we finally measure the degree to which Russian media has influenced Chinese coverage across Chinese outlets’ news articles, Weibo accounts, and Twitter accounts. Our analysis indicates that since the Russian invasion of Ukraine, Chinese state media outlets have increasingly cited Russian outlets as news sources and spread Russian disinformation narratives.
俄罗斯入侵乌克兰的报道在西方、俄罗斯和中国媒体生态系统之间存在很大差异,其中包括宣传、虚假信息和叙事旋转。利用归一化的点互信息度量、差分情感分析、word2vec模型和部分标记的Dirichlet分配,我们对这三个新闻生态系统之间的覆盖差异进行了定量分析。我们发现,当西方媒体关注战争的军事和人道主义方面时,俄罗斯媒体关注的是所谓的“特殊军事行动”的理由,比如在乌克兰存在“生物武器”和“新纳粹分子”,而中国新闻媒体关注的是冲突的外交和经济后果。在几家中国媒体的文章中发现了一些俄罗斯虚假信息叙事的存在,我们最终衡量了俄罗斯媒体在中国媒体的新闻文章、微博账户和推特账户中影响中国报道的程度。我们的分析表明,自俄罗斯入侵乌克兰以来,中国官方媒体越来越多地引用俄罗斯媒体作为新闻来源,并传播俄罗斯的虚假信息叙事。
{"title":"\"A Special Operation\": A Quantitative Approach to Dissecting and Comparing Different Media Ecosystems’ Coverage of the Russo-Ukrainian War","authors":"Hans W. A. Hanley, Deepak Kumar, Zakir Durumeric","doi":"10.1609/icwsm.v17i1.22150","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22150","url":null,"abstract":"The coverage of the Russian invasion of Ukraine has varied widely between Western, Russian, and Chinese media ecosystems with propaganda, disinformation, and narrative spins present in all three. By utilizing the normalized pointwise mutual information metric, differential sentiment analysis, word2vec models, and partially labeled Dirichlet allocation, we present a quantitative analysis of the differences in coverage amongst these three news ecosystems. We find that while the Western press outlets have focused on the military and humanitarian aspects of the war, Russian media have focused on the purported justifications for the “special military operation” such as the presence in Ukraine of “bio-weapons” and “neo-nazis”, and Chinese news media have concentrated on the conflict’s diplomatic and economic consequences. Detecting the presence of several Russian disinformation narratives in the articles of several Chinese media outlets, we finally measure the degree to which Russian media has influenced Chinese coverage across Chinese outlets’ news articles, Weibo accounts, and Twitter accounts. Our analysis indicates that since the Russian invasion of Ukraine, Chinese state media outlets have increasingly cited Russian outlets as news sources and spread Russian disinformation narratives.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135910221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Quotatives Indicate Decline in Objectivity in U.S. Political News 引用表明美国政治新闻客观性的下降
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22152
Tiancheng Hu, Manoel Horta Ribeiro, Robert West, Andreas Spitz
According to journalistic standards, direct quotes should be attributed to sources with objective quotatives such as ``said'' and ``told,'' since nonobjective quotatives, e.g., ``argued'' and ``insisted,'' would influence the readers' perception of the quote and the quoted person. In this paper, we analyze the adherence to this journalistic norm to study trends in objectivity in political news across U.S. outlets of different ideological leanings. We ask: 1) How has the usage of nonobjective quotatives evolved? 2) How do news outlets use nonobjective quotatives when covering politicians of different parties? To answer these questions, we developed a dependency-parsing-based method to extract quotatives and applied it to Quotebank, a web-scale corpus of attributed quotes, obtaining nearly 7 million quotes, each enriched with the quoted speaker's political party and the ideological leaning of the outlet that published the quote. We find that, while partisan outlets are the ones that most often use nonobjective quotatives, between 2013 and 2020, the outlets that increased their usage of nonobjective quotatives the most were ``moderate'' centrist news outlets (around 0.6 percentage points, or 20% in relative percentage over seven years). Further, we find that outlets use nonobjective quotatives more often when quoting politicians of the opposing ideology (e.g., left-leaning outlets quoting Republicans) and that this ``quotative bias'' is rising at a swift pace, increasing up to 0.5 percentage points, or 25% in relative percentage, per year. These findings suggest an overall decline in journalistic objectivity in U.S. political news.
根据新闻标准,直接引用应该归功于有客观引号的来源,如“said”和“told”,因为非客观引号,如“argue”和“insist”,会影响读者对引用和被引用人的看法。在本文中,我们分析了对这一新闻规范的遵守情况,以研究不同意识形态倾向的美国媒体在政治新闻客观性方面的趋势。我们的问题是:1)非客观引语的用法是如何演变的?2)新闻媒体在报道不同政党的政治家时,如何使用非客观的引述?为了回答这些问题,我们开发了一种基于依赖解析的引语提取方法,并将其应用于Quotebank(一个网络规模的引语语料库),获得了近700万条引语,每条引语都富含被引者的政党和发表引语的媒体的意识形态倾向。我们发现,虽然党派媒体是最常使用非客观引用的媒体,但在2013年至2020年期间,使用非客观引用最多的媒体是“温和”的中间派新闻媒体(约0.6个百分点,或七年的相对百分比为20%)。此外,我们发现媒体在引用反对意识形态的政治家时(例如,左倾媒体引用共和党人)更频繁地使用非客观引用,而且这种“引用偏见”正在迅速上升,每年增加0.5个百分点,或相对百分比增加25%。这些发现表明,美国政治新闻的新闻客观性总体上有所下降。
{"title":"Quotatives Indicate Decline in Objectivity in U.S. Political News","authors":"Tiancheng Hu, Manoel Horta Ribeiro, Robert West, Andreas Spitz","doi":"10.1609/icwsm.v17i1.22152","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22152","url":null,"abstract":"According to journalistic standards, direct quotes should be attributed to sources with objective quotatives such as ``said'' and ``told,'' since nonobjective quotatives, e.g., ``argued'' and ``insisted,'' would influence the readers' perception of the quote and the quoted person. In this paper, we analyze the adherence to this journalistic norm to study trends in objectivity in political news across U.S. outlets of different ideological leanings. We ask: 1) How has the usage of nonobjective quotatives evolved? 2) How do news outlets use nonobjective quotatives when covering politicians of different parties? To answer these questions, we developed a dependency-parsing-based method to extract quotatives and applied it to Quotebank, a web-scale corpus of attributed quotes, obtaining nearly 7 million quotes, each enriched with the quoted speaker's political party and the ideological leaning of the outlet that published the quote. We find that, while partisan outlets are the ones that most often use nonobjective quotatives, between 2013 and 2020, the outlets that increased their usage of nonobjective quotatives the most were ``moderate'' centrist news outlets (around 0.6 percentage points, or 20% in relative percentage over seven years). Further, we find that outlets use nonobjective quotatives more often when quoting politicians of the opposing ideology (e.g., left-leaning outlets quoting Republicans) and that this ``quotative bias'' is rising at a swift pace, increasing up to 0.5 percentage points, or 25% in relative percentage, per year. These findings suggest an overall decline in journalistic objectivity in U.S. political news.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136040991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conversation Modeling to Predict Derailment 会话建模预测脱轨
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22200
Jiaqing Yuan, Munindar P. Singh
Conversations among online users sometimes derail, i.e., break down into personal attacks. Derailment interferes with the healthy growth of communities in cyberspace. The ability to predict whether an ongoing conversation will derail could provide valuable advance, even real-time, insight to both interlocutors and moderators. Prior approaches predict conversation derailment retrospectively without the ability to forestall the derailment proactively. Some existing works attempt to make dynamic predictions as the conversation develops, but fail to incorporate multisource information, such as conversational structure and distance to derailment. We propose a hierarchical transformer-based framework that combines utterance-level and conversation-level information to capture fine-grained contextual semantics. We propose a domain-adaptive pretraining objective to unite conversational structure information and a multitask learning scheme to leverage the distance from each utterance to derailment. An evaluation of our framework on two conversation derailment datasets shows an improvement in F1 score for the prediction of derailment. These results demonstrate the effectiveness of incorporating multisource information for predicting the derailment of a conversation.
在线用户之间的对话有时会脱轨,也就是说,会演变成人身攻击。出轨会干扰网络空间社区的健康发展。预测正在进行的对话是否会脱轨的能力可以为对话者和主持人提供有价值的进步,甚至是实时的洞察力。先前的方法是回顾性地预测谈话的脱轨,而没有能力预先阻止脱轨。一些现有的研究试图对对话的发展进行动态预测,但未能纳入多源信息,如对话结构和离出轨的距离。我们提出了一个基于层次转换器的框架,该框架结合了话语级和会话级信息来捕获细粒度的上下文语义。我们提出了一个领域自适应的预训练目标来统一会话结构信息,并提出了一个多任务学习方案来利用从每个话语到脱轨的距离。对我们的框架在两个会话脱轨数据集上的评估显示,脱轨预测的F1分数有所提高。这些结果证明了结合多源信息预测会话脱轨的有效性。
{"title":"Conversation Modeling to Predict Derailment","authors":"Jiaqing Yuan, Munindar P. Singh","doi":"10.1609/icwsm.v17i1.22200","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22200","url":null,"abstract":"Conversations among online users sometimes derail, i.e., break down into personal attacks. Derailment interferes with the healthy growth of communities in cyberspace. The ability to predict whether an ongoing conversation will derail could provide valuable advance, even real-time, insight to both interlocutors and moderators. Prior approaches predict conversation derailment retrospectively without the ability to forestall the derailment proactively. Some existing works attempt to make dynamic predictions as the conversation develops, but fail to incorporate multisource information, such as conversational structure and distance to derailment. We propose a hierarchical transformer-based framework that combines utterance-level and conversation-level information to capture fine-grained contextual semantics. We propose a domain-adaptive pretraining objective to unite conversational structure information and a multitask learning scheme to leverage the distance from each utterance to derailment. An evaluation of our framework on two conversation derailment datasets shows an improvement in F1 score for the prediction of derailment. These results demonstrate the effectiveness of incorporating multisource information for predicting the derailment of a conversation.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136041107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
"Dummy Grandpa, Do You Know Anything?": Identifying and Characterizing Ad Hominem Fallacy Usage in the Wild “笨蛋爷爷,你知道什么吗?”识别和表征在野外使用人身攻击谬误
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22180
Utkarsh Patel, Animesh Mukherjee, Mainack Mondal
Today, participating in discussions on online forums is extremely commonplace and these discussions have started rendering a strong influence on the overall opinion of online users. Naturally, twisting the flow of the argument can have a strong impact on the minds of naive users, which in the long run might have socio-political ramifications, for example, winning an election or spreading targeted misinformation. Thus, these platforms are potentially highly vulnerable to malicious players who might act individually or as a cohort to breed fallacious arguments with a motive to sway public opinion. Ad hominem arguments are one of the most effective forms of such fallacies. Although a simple fallacy, it is effective enough to sway public debates in offline world and can be used as a precursor to shutting down the voice of opposition by slander. In this work, we take a first step in shedding light on the usage of ad hominem fallacies in the wild. First, we build a powerful ad hominem detector based on transformer architecture with high accuracy (F1 more than 83%, showing a significant improvement over prior work), even for datasets for which annotated instances constitute a very small fraction. We then used our detector on 265k arguments collected from the online debate forum – CreateDebate. Our crowdsourced surveys validate our in-the-wild predictions on CreateDebate data (94% match with manual annotation). Our analysis revealed that a surprising 31.23% of CreateDebate content contains ad hominem fallacy, and a cohort of highly active users post significantly more ad hominem to suppress opposing views. Then, our temporal analysis revealed that ad hominem argument usage increased significantly since the 2016 US Presidential election, not only for topics like Politics, but also for Science and Law. We conclude by discussing important implications of our work to detect and defend against ad hominem fallacies.
今天,参与网络论坛的讨论是非常普遍的,这些讨论已经开始对网络用户的整体意见产生强烈的影响。当然,扭曲论点的流向会对天真用户的思想产生强烈影响,从长远来看,这可能会产生社会政治后果,例如,赢得选举或传播有针对性的错误信息。因此,这些平台可能极易受到恶意玩家的攻击,他们可能单独行动,或作为一个群体,以影响公众舆论的动机滋生错误的论点。人身攻击论证是这种谬论最有效的形式之一。虽然这是一个简单的谬论,但它足以影响离线世界的公共辩论,并可以作为通过诽谤来关闭反对声音的前兆。在这项工作中,我们迈出了第一步,阐明了在野外使用人身攻击谬论。首先,我们构建了一个基于变压器架构的功能强大的人身攻击检测器,具有很高的准确性(F1超过83%,比以前的工作有了显著的改进),即使对于带有注释的实例只占很小一部分的数据集也是如此。然后,我们对在线辩论论坛CreateDebate收集的265k个论点使用检测器。我们的众包调查验证了我们对CreateDebate数据的预测(94%与手动注释相符)。我们的分析显示,令人惊讶的是,有31.23%的CreateDebate内容包含人身攻击谬误,而且一群高度活跃的用户发布了更多的人身攻击来压制反对意见。然后,我们的时间分析显示,自2016年美国总统大选以来,人身攻击论点的使用显著增加,不仅适用于政治等话题,也适用于科学和法律。最后,我们讨论了我们的工作对检测和防御人身攻击谬论的重要意义。
{"title":"\"Dummy Grandpa, Do You Know Anything?\": Identifying and Characterizing Ad Hominem Fallacy Usage in the Wild","authors":"Utkarsh Patel, Animesh Mukherjee, Mainack Mondal","doi":"10.1609/icwsm.v17i1.22180","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22180","url":null,"abstract":"Today, participating in discussions on online forums is extremely commonplace and these discussions have started rendering a strong influence on the overall opinion of online users. Naturally, twisting the flow of the argument can have a strong impact on the minds of naive users, which in the long run might have socio-political ramifications, for example, winning an election or spreading targeted misinformation. Thus, these platforms are potentially highly vulnerable to malicious players who might act individually or as a cohort to breed fallacious arguments with a motive to sway public opinion. Ad hominem arguments are one of the most effective forms of such fallacies. Although a simple fallacy, it is effective enough to sway public debates in offline world and can be used as a precursor to shutting down the voice of opposition by slander. In this work, we take a first step in shedding light on the usage of ad hominem fallacies in the wild. First, we build a powerful ad hominem detector based on transformer architecture with high accuracy (F1 more than 83%, showing a significant improvement over prior work), even for datasets for which annotated instances constitute a very small fraction. We then used our detector on 265k arguments collected from the online debate forum – CreateDebate. Our crowdsourced surveys validate our in-the-wild predictions on CreateDebate data (94% match with manual annotation). Our analysis revealed that a surprising 31.23% of CreateDebate content contains ad hominem fallacy, and a cohort of highly active users post significantly more ad hominem to suppress opposing views. Then, our temporal analysis revealed that ad hominem argument usage increased significantly since the 2016 US Presidential election, not only for topics like Politics, but also for Science and Law. We conclude by discussing important implications of our work to detect and defend against ad hominem fallacies.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136041101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What Are You Anxious About? Examining Subjects of Anxiety during the COVID-19 Pandemic 你在焦虑什么?COVID-19大流行期间焦虑受试者的研究
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22133
Lucia L. Chen, Steven R. Wilson, Sophie Lohmann, Daniela V. Negraia
COVID-19 poses disproportionate mental health consequences to the public during different phases of the pandemic. We use a computational approach to capture the specific aspects that trigger the public's anxiety about the pandemic and investigate how these aspects change over time. First, we identified nine subjects of anxiety (SOAs) in a sample of Reddit posts (N=86) from r/COVID19_support using the thematic analysis approach. Then, we quantified Reddit users' anxiety by training algorithms on a manually annotated sample (N=793) to annotate the SOAs in a larger chronological sample (N=6,535). The nine SOAs align with items in various recently developed pandemic anxiety measurement scales. We observed that Reddit users' concerns about health risks remained high in the first eight months since the pandemic started. These concerns diminished dramatically despite the surge of cases occurring later. In general, users' language disclosing the SOAs became less intense as the pandemic progressed. However, worries about mental health and the future steadily increased throughout the period covered in this study. People also tended to use more intense language to describe mental health concerns than health risk or death concerns. Our results suggest that the public's mental health condition does not necessarily improve despite COVID-19 as a health threat gradually weakening due to appropriate countermeasures. Our system lays the groundwork for population health and epidemiology scholars to examine aspects that provoke pandemic anxiety in a timely fashion.
COVID-19在大流行的不同阶段对公众造成了不成比例的心理健康后果。我们使用计算方法捕捉引发公众对大流行焦虑的具体方面,并调查这些方面如何随时间变化。首先,我们使用主题分析方法从r/COVID19_support的Reddit帖子样本(N=86)中确定了9个焦虑主题(soa)。然后,我们通过在手动注释的样本(N=793)上训练算法来量化Reddit用户的焦虑,以便在更大的按时间顺序排列的样本(N=6,535)中注释soa。9个soa与最近开发的各种大流行焦虑测量量表中的项目一致。我们观察到,自大流行开始以来的前八个月,Reddit用户对健康风险的担忧仍然很高。尽管后来发生的病例激增,但这些担忧大大减少。一般来说,随着疫情的发展,用户披露soa的语言变得不那么激烈了。然而,在研究期间,对心理健康和未来的担忧稳步增加。人们还倾向于使用更激烈的语言来描述心理健康问题,而不是健康风险或死亡问题。我们的研究结果表明,尽管COVID-19作为健康威胁逐渐减弱,但由于适当的对策,公众的心理健康状况并不一定会改善。我们的系统为人口健康和流行病学学者及时研究引发大流行焦虑的方面奠定了基础。
{"title":"What Are You Anxious About? Examining Subjects of Anxiety during the COVID-19 Pandemic","authors":"Lucia L. Chen, Steven R. Wilson, Sophie Lohmann, Daniela V. Negraia","doi":"10.1609/icwsm.v17i1.22133","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22133","url":null,"abstract":"COVID-19 poses disproportionate mental health consequences to the public during different phases of the pandemic. We use a computational approach to capture the specific aspects that trigger the public's anxiety about the pandemic and investigate how these aspects change over time. First, we identified nine subjects of anxiety (SOAs) in a sample of Reddit posts (N=86) from r/COVID19_support using the thematic analysis approach. Then, we quantified Reddit users' anxiety by training algorithms on a manually annotated sample (N=793) to annotate the SOAs in a larger chronological sample (N=6,535). The nine SOAs align with items in various recently developed pandemic anxiety measurement scales. We observed that Reddit users' concerns about health risks remained high in the first eight months since the pandemic started. These concerns diminished dramatically despite the surge of cases occurring later. In general, users' language disclosing the SOAs became less intense as the pandemic progressed. However, worries about mental health and the future steadily increased throughout the period covered in this study. People also tended to use more intense language to describe mental health concerns than health risk or death concerns. Our results suggest that the public's mental health condition does not necessarily improve despite COVID-19 as a health threat gradually weakening due to appropriate countermeasures. Our system lays the groundwork for population health and epidemiology scholars to examine aspects that provoke pandemic anxiety in a timely fashion.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136041103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Half-Life of a Tweet 推文的半衰期
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22228
Jürgen Pfeffer, Daniel Matter, Anahit Sargsyan
Twitter has started to share an impression count variable as part of the available public metrics for every Tweet collected with Twitter’s APIs. With the information about how often a particular Tweet has been shown to Twitter users at the time of data collection, we can learn important insights about the dissemination process of a Tweet by measuring its impression count repeatedly over time. With our preliminary analysis, we can show that on average the peak of impressions per second is 72 seconds after a Tweet was sent and that after 24 hours, no relevant number of impressions can be observed for ∼95% of all Tweets. Finally, we estimate that the median half-life of a Tweet, i.e. the time it takes before half of all impressions are created, is about 80 minutes.
Twitter已经开始分享一个印象计数变量,作为Twitter api收集的每条Tweet的可用公共指标的一部分。在数据收集时,有了关于特定Tweet向Twitter用户展示的频率的信息,我们可以通过反复测量Tweet的印象数来了解Tweet传播过程的重要见解。通过我们的初步分析,我们可以表明,平均每秒展示次数的峰值是在Tweet发送后72秒,并且在24小时后,可以观察到95%的Tweet没有相关的展示次数。最后,我们估计一条Tweet的中间半衰期(游戏邦注:即创造一半印象所需的时间)约为80分钟。
{"title":"The Half-Life of a Tweet","authors":"Jürgen Pfeffer, Daniel Matter, Anahit Sargsyan","doi":"10.1609/icwsm.v17i1.22228","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22228","url":null,"abstract":"Twitter has started to share an impression count variable as part of the available public metrics for every Tweet collected with Twitter’s APIs. With the information about how often a particular Tweet has been shown to Twitter users at the time of data collection, we can learn important insights about the dissemination process of a Tweet by measuring its impression count repeatedly over time. With our preliminary analysis, we can show that on average the peak of impressions per second is 72 seconds after a Tweet was sent and that after 24 hours, no relevant number of impressions can be observed for ∼95% of all Tweets. Finally, we estimate that the median half-life of a Tweet, i.e. the time it takes before half of all impressions are created, is about 80 minutes.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136041110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Data Fusion Framework for Multi-Domain Morality Learning 面向多领域道德学习的数据融合框架
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22145
Siyi Guo, Negar Mokhberian, Kristina Lerman
Language models can be trained to recognize the moral sentiment of text, creating new opportunities to study the role of morality in human life. As interest in language and morality has grown, several ground truth datasets with moral annotations have been released. However, these datasets vary in the method of data collection, domain, topics, instructions for annotators, etc. Simply aggregating such heterogeneous datasets during training can yield models that fail to generalize well. We describe a data fusion framework for training on multiple heterogeneous datasets that improve performance and generalizability. The model uses domain adversarial training to align the datasets in feature space and a weighted loss function to deal with label shift. We show that the proposed framework achieves state-of-the-art performance in different datasets compared to prior works in morality inference.
语言模型可以被训练来识别文本的道德情感,为研究道德在人类生活中的作用创造了新的机会。随着人们对语言和道德的兴趣不断增长,已经发布了几个带有道德注释的基础真理数据集。然而,这些数据集在数据收集方法、领域、主题、注释者说明等方面各不相同。在训练期间简单地聚合这些异构数据集可能会产生不能很好地泛化的模型。我们描述了一个数据融合框架,用于在多个异构数据集上进行训练,以提高性能和泛化性。该模型使用域对抗训练来对齐特征空间中的数据集,并使用加权损失函数来处理标签移位。我们表明,与先前的道德推理工作相比,所提出的框架在不同的数据集中实现了最先进的性能。
{"title":"A Data Fusion Framework for Multi-Domain Morality Learning","authors":"Siyi Guo, Negar Mokhberian, Kristina Lerman","doi":"10.1609/icwsm.v17i1.22145","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22145","url":null,"abstract":"Language models can be trained to recognize the moral sentiment of text, creating new opportunities to study the role of morality in human life. As interest in language and morality has grown, several ground truth datasets with moral annotations have been released. However, these datasets vary in the method of data collection, domain, topics, instructions for annotators, etc. Simply aggregating such heterogeneous datasets during training can yield models that fail to generalize well. We describe a data fusion framework for training on multiple heterogeneous datasets that improve performance and generalizability. The model uses domain adversarial training to align the datasets in feature space and a weighted loss function to deal with label shift. We show that the proposed framework achieves state-of-the-art performance in different datasets compared to prior works in morality inference.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136040981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multi-Platform Collection of Social Media Posts about the 2022 U.S. Midterm Elections 关于2022年美国中期选举的社交媒体帖子的多平台收集
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22205
Rachith Aiyappa, Matthew R. DeVerna, Manita Pote, Bao Tran Truong, Wanying Zhao, David Axelrod, Aria Pessianzadeh, Zoher Kachwala, Munjung Kim, Ozgur Can Seckin, Minsuk Kim, Sunny Gandhi, Amrutha Manikonda, Francesco Pierri, Filippo Menczer, Kai-Cheng Yang
Social media are utilized by millions of citizens to discuss important political issues. Politicians use these platforms to connect with the public and broadcast policy positions. Therefore, data from social media has enabled many studies of political discussion. While most analyses are limited to data from individual platforms, people are embedded in a larger information ecosystem spanning multiple social networks. Here we describe and provide access to the Indiana University 2022 U.S. Midterms Multi-Platform Social Media Dataset (MEIU22), a collection of social media posts from Twitter, Facebook, Instagram, Reddit, and 4chan. MEIU22 links to posts about the midterm elections based on a comprehensive list of keywords and tracks the social media accounts of 1,011 candidates from October 1 to December 25, 2022. We also publish the source code of our pipeline to enable similar multi-platform research projects.
数以百万计的公民利用社交媒体来讨论重要的政治问题。政客们利用这些平台与公众建立联系,并宣传自己的政策立场。因此,来自社交媒体的数据使得许多关于政治讨论的研究成为可能。虽然大多数分析仅限于来自单个平台的数据,但人们被嵌入到跨越多个社交网络的更大的信息生态系统中。在这里,我们描述并提供对印第安纳大学2022年美国中期多平台社交媒体数据集(MEIU22)的访问,这是来自Twitter, Facebook, Instagram, Reddit和4chan的社交媒体帖子的集合。meu22根据综合关键词列表链接中期选举相关帖子,并追踪2022年10月1日至12月25日期间1011名候选人的社交媒体账户。我们还发布了管道的源代码,以支持类似的多平台研究项目。
{"title":"A Multi-Platform Collection of Social Media Posts about the 2022 U.S. Midterm Elections","authors":"Rachith Aiyappa, Matthew R. DeVerna, Manita Pote, Bao Tran Truong, Wanying Zhao, David Axelrod, Aria Pessianzadeh, Zoher Kachwala, Munjung Kim, Ozgur Can Seckin, Minsuk Kim, Sunny Gandhi, Amrutha Manikonda, Francesco Pierri, Filippo Menczer, Kai-Cheng Yang","doi":"10.1609/icwsm.v17i1.22205","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22205","url":null,"abstract":"Social media are utilized by millions of citizens to discuss important political issues. Politicians use these platforms to connect with the public and broadcast policy positions. Therefore, data from social media has enabled many studies of political discussion. While most analyses are limited to data from individual platforms, people are embedded in a larger information ecosystem spanning multiple social networks. Here we describe and provide access to the Indiana University 2022 U.S. Midterms Multi-Platform Social Media Dataset (MEIU22), a collection of social media posts from Twitter, Facebook, Instagram, Reddit, and 4chan. MEIU22 links to posts about the midterm elections based on a comprehensive list of keywords and tracks the social media accounts of 1,011 candidates from October 1 to December 25, 2022. We also publish the source code of our pipeline to enable similar multi-platform research projects.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136041295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
#RoeOverturned: Twitter Dataset on the Abortion Rights Controversy #罗伊被推翻:关于堕胎权争议的推特数据集
Pub Date : 2023-06-02 DOI: 10.1609/icwsm.v17i1.22207
Rong-Ching Chang, Ashwin Rao, Qiankun Zhong, Magdalena Wojcieszak, Kristina Lerman
On June 24, 2022, the United States Supreme Court overturned landmark rulings made in its 1973 verdict in Roe v. Wade. The justices by way of a majority vote in Dobbs v. Jackson Women's Health Organization, decided that abortion wasn't a constitutional right and returned the issue of abortion to the elected representatives. This decision triggered multiple protests and debates across the US, especially in the context of the midterm elections in November 2022. Given that many citizens use social media platforms to express their views and mobilize for collective action, and given that online debate provides tangible effects on public opinion, political participation, news media coverage, and the political decision-making, it is crucial to understand online discussions surrounding this topic. Toward this end, we present the first large-scale Twitter dataset collected on the abortion rights debate in the United States. We present a set of 74M tweets systematically collected over the course of one year from January 1, 2022 to January 6, 2023.
2022年6月24日,美国最高法院推翻了1973年罗伊诉韦德案的判决。在多布斯诉杰克逊妇女健康组织案中,法官们以多数票决定堕胎不是宪法赋予的权利,并将堕胎问题退回给民选代表。这一决定在美国各地引发了多次抗议和辩论,特别是在2022年11月中期选举的背景下。鉴于许多公民使用社交媒体平台来表达自己的观点和动员集体行动,鉴于网络辩论对公众舆论、政治参与、新闻媒体报道和政治决策产生了切实的影响,理解围绕这一主题的网络讨论至关重要。为此,我们提出了第一个大规模的关于美国堕胎权辩论的Twitter数据集。我们展示了一组从2022年1月1日到2023年1月6日的一年中系统收集的7400万条推文。
{"title":"#RoeOverturned: Twitter Dataset on the Abortion Rights Controversy","authors":"Rong-Ching Chang, Ashwin Rao, Qiankun Zhong, Magdalena Wojcieszak, Kristina Lerman","doi":"10.1609/icwsm.v17i1.22207","DOIUrl":"https://doi.org/10.1609/icwsm.v17i1.22207","url":null,"abstract":"On June 24, 2022, the United States Supreme Court overturned landmark rulings made in its 1973 verdict in Roe v. Wade. The justices by way of a majority vote in Dobbs v. Jackson Women's Health Organization, decided that abortion wasn't a constitutional right and returned the issue of abortion to the elected representatives. This decision triggered multiple protests and debates across the US, especially in the context of the midterm elections in November 2022. Given that many citizens use social media platforms to express their views and mobilize for collective action, and given that online debate provides tangible effects on public opinion, political participation, news media coverage, and the political decision-making, it is crucial to understand online discussions surrounding this topic. Toward this end, we present the first large-scale Twitter dataset collected on the abortion rights debate in the United States. We present a set of 74M tweets systematically collected over the course of one year from January 1, 2022 to January 6, 2023.","PeriodicalId":338112,"journal":{"name":"Proceedings of the International AAAI Conference on Web and Social Media","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135909940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Proceedings of the International AAAI Conference on Web and Social Media
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1