首页 > 最新文献

Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media最新文献

英文 中文
Community Identity and User Engagement in a Multi-Community Landscape. 多社区景观中的社区身份和用户参与。
Justine Zhang, William L Hamilton, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, Jure Leskovec

A community's identity defines and shapes its internal dynamics. Our current understanding of this interplay is mostly limited to glimpses gathered from isolated studies of individual communities. In this work we provide a systematic exploration of the nature of this relation across a wide variety of online communities. To this end we introduce a quantitative, language-based typology reflecting two key aspects of a community's identity: how distinctive, and how temporally dynamic it is. By mapping almost 300 Reddit communities into the landscape induced by this typology, we reveal regularities in how patterns of user engagement vary with the characteristics of a community. Our results suggest that the way new and existing users engage with a community depends strongly and systematically on the nature of the collective identity it fosters, in ways that are highly consequential to community maintainers. For example, communities with distinctive and highly dynamic identities are more likely to retain their users. However, such niche communities also exhibit much larger acculturation gaps between existing users and newcomers, which potentially hinder the integration of the latter. More generally, our methodology reveals differences in how various social phenomena manifest across communities, and shows that structuring the multi-community landscape can lead to a better understanding of the systematic nature of this diversity.

一个社区的身份定义并塑造了它的内部动态。我们目前对这种相互作用的理解主要局限于从个别社区的孤立研究中收集的一瞥。在这项工作中,我们在各种各样的在线社区中对这种关系的本质进行了系统的探索。为此,我们引入了一种定量的、基于语言的类型学,反映了社区身份的两个关键方面:如何与众不同,以及它在时间上是如何动态的。通过将近300个Reddit社区映射到这种类型的景观中,我们揭示了用户参与模式如何随社区特征而变化的规律。我们的研究结果表明,新用户和现有用户与社区互动的方式强烈而系统地依赖于它所培养的集体身份的性质,这对社区维护者来说是非常重要的。例如,具有独特和高度动态身份的社区更有可能留住用户。然而,这样的小众社区在现有用户和新用户之间也表现出更大的文化适应差距,这可能会阻碍后者的整合。更一般地说,我们的方法揭示了不同社会现象在不同社区中表现的差异,并表明构建多社区景观可以更好地理解这种多样性的系统本质。
{"title":"Community Identity and User Engagement in a Multi-Community Landscape.","authors":"Justine Zhang,&nbsp;William L Hamilton,&nbsp;Cristian Danescu-Niculescu-Mizil,&nbsp;Dan Jurafsky,&nbsp;Jure Leskovec","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A community's identity defines and shapes its internal dynamics. Our current understanding of this interplay is mostly limited to glimpses gathered from isolated studies of individual communities. In this work we provide a systematic exploration of the nature of this relation across a wide variety of online communities. To this end we introduce a quantitative, language-based typology reflecting two key aspects of a community's identity: how <i>distinctive</i>, and how temporally <i>dynamic</i> it is. By mapping almost 300 Reddit communities into the landscape induced by this typology, we reveal regularities in how patterns of user engagement vary with the characteristics of a community. Our results suggest that the way new and existing users engage with a community depends strongly and systematically on the nature of the collective identity it fosters, in ways that are highly consequential to community maintainers. For example, communities with distinctive and highly dynamic identities are more likely to retain their users. However, such niche communities also exhibit much larger acculturation gaps between existing users and newcomers, which potentially hinder the integration of the latter. More generally, our methodology reveals differences in how various social phenomena manifest across communities, and shows that structuring the multi-community landscape can lead to a better understanding of the systematic nature of this diversity.</p>","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"2017 ","pages":"377-386"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5774974/pdf/nihms933918.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35754496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Loyalty in Online Communities 网络社区的忠诚度
William L. Hamilton, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, J. Leskovec
Loyalty is an essential component of multi-community engagement. When users have the choice to engage with a variety of different communities, they often become loyal to just one, focusing on that community at the expense of others. However, it is unclear how loyalty is manifested in user behavior, or whether certain community characteristics encourage loyalty. In this paper we operationalize loyalty as a user-community relation: users loyal to a community consistently prefer it over all others; loyal communities retain their loyal users over time. By exploring a large set of Reddit communities, we reveal that loyalty is manifested in remarkably consistent behaviors. Loyal users employ language that signals collective identity and engage with more esoteric, less popular content, indicating that they may play a curational role in surfacing new material. Loyal communities have denser user-user interaction networks and lower rates of triadic closure, suggesting that community-level loyalty is associated with more cohesive interactions and less fragmentation into subgroups. We exploit these general patterns to predict future rates of loyalty. Our results show that a user's propensity to become loyal is apparent from their initial interactions with a community, suggesting that some users are intrinsically loyal from the very beginning.
忠诚是多社区参与的重要组成部分。当用户可以选择加入各种不同的社区时,他们通常会只忠于一个社区,而牺牲其他社区。然而,目前尚不清楚忠诚度是如何在用户行为中表现出来的,或者是否某些社区特征鼓励了忠诚度。在本文中,我们将忠诚度作为一种用户-社区关系进行操作:忠诚于一个社区的用户始终比其他社区更喜欢这个社区;随着时间的推移,忠实的社区会留住忠实的用户。通过研究大量的Reddit社区,我们发现忠诚表现在非常一致的行为上。忠诚的用户使用表达集体身份的语言,并参与更深奥、不太流行的内容,这表明他们可能在新材料的出现中发挥策展作用。忠诚的社区有更密集的用户-用户互动网络和更低的三合一关闭率,这表明社区层面的忠诚与更紧密的互动和更少的分裂成子群体有关。我们利用这些一般模式来预测未来的忠诚度。我们的研究结果表明,用户的忠诚倾向从他们最初与社区的互动中就可以明显看出,这表明一些用户从一开始就具有内在的忠诚。
{"title":"Loyalty in Online Communities","authors":"William L. Hamilton, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, J. Leskovec","doi":"10.1609/icwsm.v11i1.14972","DOIUrl":"https://doi.org/10.1609/icwsm.v11i1.14972","url":null,"abstract":"Loyalty is an essential component of multi-community engagement. When users have the choice to engage with a variety of different communities, they often become loyal to just one, focusing on that community at the expense of others. However, it is unclear how loyalty is manifested in user behavior, or whether certain community characteristics encourage loyalty. In this paper we operationalize loyalty as a user-community relation: users loyal to a community consistently prefer it over all others; loyal communities retain their loyal users over time. By exploring a large set of Reddit communities, we reveal that loyalty is manifested in remarkably consistent behaviors. Loyal users employ language that signals collective identity and engage with more esoteric, less popular content, indicating that they may play a curational role in surfacing new material. Loyal communities have denser user-user interaction networks and lower rates of triadic closure, suggesting that community-level loyalty is associated with more cohesive interactions and less fragmentation into subgroups. We exploit these general patterns to predict future rates of loyalty. Our results show that a user's propensity to become loyal is apparent from their initial interactions with a community, suggesting that some users are intrinsically loyal from the very beginning.","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"27 1","pages":"540-543"},"PeriodicalIF":0.0,"publicationDate":"2017-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90805480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 76
Social Media Participation in an Activist Movement for Racial Equality. 社会媒体参与种族平等运动。
Munmun De Choudhury, Shagun Jhaver, Benjamin Sugar, Ingmar Weber

From the Arab Spring to the Occupy Movement, social media has been instrumental in driving and supporting socio-political movements throughout the world. In this paper, we present one of the first social media investigations of an activist movement around racial discrimination and police violence, known as "Black Lives Matter". Considering Twitter as a sensor for the broader community's perception of the events related to the movement, we study participation over time, the geographical differences in this participation, and its relationship to protests that unfolded on the ground. We find evidence for continued participation across four temporally separated events related to the movement, with notable changes in engagement and language over time. We also find that participants from regions of historically high rates of black victimization due to police violence tend to express greater negativity and make more references to loss of life. Finally, we observe that social media attributes of affect, behavior and language can predict future protest participation on the ground. We discuss the role of social media in enabling collective action around this unique movement and how social media platforms may help understand perceptions on a socially contested and sensitive issue like race.

从阿拉伯之春到占领运动,社交媒体在推动和支持世界各地的社会政治运动方面发挥了重要作用。在这篇论文中,我们提出了第一个关于围绕种族歧视和警察暴力的激进运动的社交媒体调查之一,被称为“黑人的命也重要”。考虑到Twitter是更广泛的社区对与运动相关的事件感知的传感器,我们研究了随时间的参与,这种参与的地理差异,以及它与地面上展开的抗议活动的关系。我们发现有证据表明,在与该运动相关的四个暂时分开的事件中,人们继续参与,随着时间的推移,参与度和语言发生了显著变化。我们还发现,来自历史上黑人因警察暴力受害率高的地区的参与者倾向于表达更大的消极情绪,并更多地提及生命损失。最后,我们观察到情感、行为和语言的社交媒体属性可以预测未来的抗议参与。我们讨论了社交媒体在推动围绕这一独特运动的集体行动中的作用,以及社交媒体平台如何帮助理解对种族等社会争议和敏感问题的看法。
{"title":"Social Media Participation in an Activist Movement for Racial Equality.","authors":"Munmun De Choudhury,&nbsp;Shagun Jhaver,&nbsp;Benjamin Sugar,&nbsp;Ingmar Weber","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>From the Arab Spring to the Occupy Movement, social media has been instrumental in driving and supporting socio-political movements throughout the world. In this paper, we present one of the first social media investigations of an activist movement around racial discrimination and police violence, known as \"Black Lives Matter\". Considering Twitter as a sensor for the broader community's perception of the events related to the movement, we study participation over time, the geographical differences in this participation, and its relationship to protests that unfolded on the ground. We find evidence for continued participation across four temporally separated events related to the movement, with notable changes in engagement and language over time. We also find that participants from regions of historically high rates of black victimization due to police violence tend to express greater negativity and make more references to loss of life. Finally, we observe that social media attributes of affect, behavior and language can predict future protest participation on the ground. We discuss the role of social media in enabling collective action around this unique movement and how social media platforms may help understand perceptions on a socially contested and sensitive issue like race.</p>","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"2016 ","pages":"92-101"},"PeriodicalIF":0.0,"publicationDate":"2016-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5565729/pdf/nihms891348.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35346751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
#PrayForDad: Learning the Semantics Behind Why Social Media Users Disclose Health Information #为爸爸祈祷:学习社交媒体用户披露健康信息背后的语义
Zhijun Yin, You Chen, D. Fabbri, Jimeng Sun, B. Malin
User-generated content in social media is increasingly acknowledged as a rich resource for research into health problems. One particular area of interest is in the semantics individuals evoke because they can influence when health-related information is disclosed. While there have been multiple investigations into why self-disclose occurs, much less is known about when individuals choose to disclose information about other people (e.g., a relative), which is a significant privacy concern. In this paper, we introduce a novel framework to investigate how semantics influence disclosure routines for 34 health issues. This framework begins with a supervised classification model to distinguish tweets that communicate personal health issues from confounding concepts (e.g., metaphorical statements that include a health-related keyword). Next, we annotate tweets for each health issue with linguistic and psychological categories (e.g. social processes, affective processes and personal concerns). Then, we apply a non-negative matrix factorization over a health issue-by-language category space. Finally, the factorized basis space is leveraged to group health issues into natural aggregations based around how they are discussed. We evaluate this framework with four months of tweets (over 200 million) and show that certain semantics correspond with whom a health mention pertains to. Our findings show that health issues related with family members, high medical cost and social support (e.g., Alzheimer's Disease, cancer, and Down syndrome) lead to tweets that are more likely to disclose another individual's health status, while tweets with more benign health issues (e.g., allergy, arthritis, and bronchitis) with biological processes (e.g., health and ingestion) and negative emotions are more likely to contain self-disclosures.
社交媒体上用户生成的内容日益被认为是研究健康问题的丰富资源。一个特别感兴趣的领域是个人唤起的语义,因为他们可以影响何时披露与健康有关的信息。虽然对自我表露的原因已经进行了多次调查,但人们对个人何时选择披露他人(例如,亲戚)的信息知之甚少,这是一个重要的隐私问题。在本文中,我们引入了一个新的框架来研究语义如何影响34个健康问题的披露程序。该框架从一个监督分类模型开始,以区分传达个人健康问题的推文与混淆概念(例如,包含与健康相关关键字的隐喻性陈述)。接下来,我们用语言和心理类别(例如社会过程、情感过程和个人关注)对每个健康问题的推文进行注释。然后,我们应用非负矩阵分解在健康问题的语言类别空间。最后,利用分解基空间根据讨论方式将健康问题分组为自然聚合。我们用四个月的推文(超过2亿条)来评估这个框架,并显示某些语义与健康提及相关的人相对应。我们的研究结果表明,与家庭成员、高医疗成本和社会支持相关的健康问题(如阿尔茨海默病、癌症和唐氏综合症)导致推文更有可能披露另一个人的健康状况,而带有更良性健康问题(如过敏、关节炎和支气管炎)、生物过程(如健康和摄入)和负面情绪的推文更有可能包含自我披露。
{"title":"#PrayForDad: Learning the Semantics Behind Why Social Media Users Disclose Health Information","authors":"Zhijun Yin, You Chen, D. Fabbri, Jimeng Sun, B. Malin","doi":"10.1609/icwsm.v10i1.14735","DOIUrl":"https://doi.org/10.1609/icwsm.v10i1.14735","url":null,"abstract":"User-generated content in social media is increasingly acknowledged as a rich resource for research into health problems. One particular area of interest is in the semantics individuals evoke because they can influence when health-related information is disclosed. While there have been multiple investigations into why self-disclose occurs, much less is known about when individuals choose to disclose information about other people (e.g., a relative), which is a significant privacy concern. In this paper, we introduce a novel framework to investigate how semantics influence disclosure routines for 34 health issues. This framework begins with a supervised classification model to distinguish tweets that communicate personal health issues from confounding concepts (e.g., metaphorical statements that include a health-related keyword). Next, we annotate tweets for each health issue with linguistic and psychological categories (e.g. social processes, affective processes and personal concerns). Then, we apply a non-negative matrix factorization over a health issue-by-language category space. Finally, the factorized basis space is leveraged to group health issues into natural aggregations based around how they are discussed. We evaluate this framework with four months of tweets (over 200 million) and show that certain semantics correspond with whom a health mention pertains to. Our findings show that health issues related with family members, high medical cost and social support (e.g., Alzheimer's Disease, cancer, and Down syndrome) lead to tweets that are more likely to disclose another individual's health status, while tweets with more benign health issues (e.g., allergy, arthritis, and bronchitis) with biological processes (e.g., health and ingestion) and negative emotions are more likely to contain self-disclosures.","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"46 1","pages":"456-465"},"PeriodicalIF":0.0,"publicationDate":"2016-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78808031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Social Media Participation in an Activist Movement for Racial Equality 社会媒体参与种族平等运动
M. Choudhury, Shagun Jhaver, Benjamin Sugar, Ingmar Weber
From the Arab Spring to the Occupy Movement, social media has been instrumental in driving and supporting socio-political movements throughout the world. In this paper, we present one of the first social media investigations of an activist movement around racial discrimination and police violence, known as "Black Lives Matter". Considering Twitter as a sensor for the broader community's perception of the events related to the movement, we study participation over time, the geographical differences in this participation, and its relationship to protests that unfolded on the ground. We find evidence for continued participation across four temporally separated events related to the movement, with notable changes in engagement and language over time. We also find that participants from regions of historically high rates of black victimization due to police violence tend to express greater negativity and make more references to loss of life. Finally, we observe that social media attributes of affect, behavior and language can predict future protest participation on the ground. We discuss the role of social media in enabling collective action around this unique movement and how social media platforms may help understand perceptions on a socially contested and sensitive issue like race.
从阿拉伯之春到占领运动,社交媒体在推动和支持世界各地的社会政治运动方面发挥了重要作用。在这篇论文中,我们提出了第一个关于围绕种族歧视和警察暴力的激进运动的社交媒体调查之一,被称为“黑人的命也重要”。考虑到Twitter是更广泛的社区对与运动相关的事件感知的传感器,我们研究了随时间的参与,这种参与的地理差异,以及它与地面上展开的抗议活动的关系。我们发现有证据表明,在与该运动相关的四个暂时分开的事件中,人们继续参与,随着时间的推移,参与度和语言发生了显著变化。我们还发现,来自历史上黑人因警察暴力受害率高的地区的参与者倾向于表达更大的消极情绪,并更多地提及生命损失。最后,我们观察到情感、行为和语言的社交媒体属性可以预测未来的抗议参与。我们讨论了社交媒体在推动围绕这一独特运动的集体行动中的作用,以及社交媒体平台如何帮助理解对种族等社会争议和敏感问题的看法。
{"title":"Social Media Participation in an Activist Movement for Racial Equality","authors":"M. Choudhury, Shagun Jhaver, Benjamin Sugar, Ingmar Weber","doi":"10.1609/icwsm.v10i1.14758","DOIUrl":"https://doi.org/10.1609/icwsm.v10i1.14758","url":null,"abstract":"From the Arab Spring to the Occupy Movement, social media has been instrumental in driving and supporting socio-political movements throughout the world. In this paper, we present one of the first social media investigations of an activist movement around racial discrimination and police violence, known as \"Black Lives Matter\". Considering Twitter as a sensor for the broader community's perception of the events related to the movement, we study participation over time, the geographical differences in this participation, and its relationship to protests that unfolded on the ground. We find evidence for continued participation across four temporally separated events related to the movement, with notable changes in engagement and language over time. We also find that participants from regions of historically high rates of black victimization due to police violence tend to express greater negativity and make more references to loss of life. Finally, we observe that social media attributes of affect, behavior and language can predict future protest participation on the ground. We discuss the role of social media in enabling collective action around this unique movement and how social media platforms may help understand perceptions on a socially contested and sensitive issue like race.","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"24 1","pages":"92-101"},"PeriodicalIF":0.0,"publicationDate":"2016-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73741266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 122
Psychological Effects of Urban Crime Gleaned from Social Media. 从社交媒体收集的城市犯罪心理影响。
José Manuel Delgado Valdes, Jacob Eisenstein, Munmun De Choudhury

Exposure to frequent crime incidents has been found to have a negative bearing on the well-being of city residents, even if they are not themselves a direct victim. We pursue the research question of whether naturalistic data shared on Twitter may provide a "lens" to understand changes in psychological attributes of urban communities (1) immediately following crime incidents, as well as (2) due to long-term exposure to crime. We analyze half a million Twitter posts from the City of Atlanta in 2014, where the rate of violent crime is three times of the national average. In a first study, we develop a statistical method to detect changes in social media psychological attributes in the immediate aftermath of a crime event. Second, we develop a regression model that uses historical (yearlong) crime to predict Twitter negative emotion, anxiety, anger, and sadness. We do not find significant changes in social media affect immediately following crime in Atlanta. However we do observe significant ability of historical crime to account for heightened negative emotion and anger in the future. Our findings have implications in gauging the utility of social media to infer longitudinal and population-scale patterns of urban well-being.

研究发现,频繁的犯罪事件对城市居民的幸福感有负面影响,即使他们自己不是直接受害者。我们追求的研究问题是,Twitter上分享的自然主义数据是否可以提供一个“镜头”来理解城市社区(1)犯罪事件发生后的心理属性变化,以及(2)由于长期暴露于犯罪。我们分析了2014年亚特兰大市的50万条推特帖子,那里的暴力犯罪率是全国平均水平的三倍。在第一项研究中,我们开发了一种统计方法来检测犯罪事件发生后社交媒体心理属性的变化。其次,我们开发了一个回归模型,使用历史(一年)犯罪来预测Twitter的负面情绪,焦虑,愤怒和悲伤。我们没有发现亚特兰大犯罪发生后社交媒体影响的显著变化。然而,我们确实观察到历史犯罪在解释未来的负面情绪和愤怒方面的显著能力。我们的研究结果对衡量社交媒体的效用,以推断城市幸福感的纵向和人口规模模式具有启示意义。
{"title":"Psychological Effects of Urban Crime Gleaned from Social Media.","authors":"José Manuel Delgado Valdes,&nbsp;Jacob Eisenstein,&nbsp;Munmun De Choudhury","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Exposure to frequent crime incidents has been found to have a negative bearing on the well-being of city residents, even if they are not themselves a direct victim. We pursue the research question of whether naturalistic data shared on Twitter may provide a \"lens\" to understand changes in psychological attributes of urban communities (1) immediately following crime incidents, as well as (2) due to long-term exposure to crime. We analyze half a million Twitter posts from the City of Atlanta in 2014, where the rate of violent crime is three times of the national average. In a first study, we develop a statistical method to detect changes in social media psychological attributes in the immediate aftermath of a crime event. Second, we develop a regression model that uses historical (yearlong) crime to predict Twitter negative emotion, anxiety, anger, and sadness. We do not find significant changes in social media affect immediately following crime in Atlanta. However we do observe significant ability of historical crime to account for heightened negative emotion and anger in the future. Our findings have implications in gauging the utility of social media to infer longitudinal and population-scale patterns of urban well-being.</p>","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"2015 ","pages":"598-601"},"PeriodicalIF":0.0,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5648364/pdf/nihms703921.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35535751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Eliciting Disease Data from Wikipedia Articles. 从维基百科文章中获取疾病数据。
Geoffrey Fairchild, Sara Y Del Valle, Lalindra De Silva, Alberto M Segre

Traditional disease surveillance systems suffer from several disadvantages, including reporting lags and antiquated technology, that have caused a movement towards internet-based disease surveillance systems. Internet systems are particularly attractive for disease outbreaks because they can provide data in near real-time and can be verified by individuals around the globe. However, most existing systems have focused on disease monitoring and do not provide a data repository for policy makers or researchers. In order to fill this gap, we analyzed Wikipedia article content. We demonstrate how a named-entity recognizer can be trained to tag case counts, death counts, and hospitalization counts in the article narrative that achieves an F1 score of 0.753. We also show, using the 2014 West African Ebola virus disease epidemic article as a case study, that there are detailed time series data that are consistently updated that closely align with ground truth data. We argue that Wikipedia can be used to create the first community-driven open-source emerging disease detection, monitoring, and repository system.

传统的疾病监测系统存在一些缺点,包括报告滞后和技术过时,这导致了向基于互联网的疾病监测系统的转变。互联网系统对疾病暴发特别有吸引力,因为它们可以提供近乎实时的数据,并且可以由全球各地的个人进行验证。然而,大多数现有的系统都侧重于疾病监测,没有为决策者或研究人员提供数据存储库。为了填补这一空白,我们分析了维基百科的文章内容。我们演示了如何训练命名实体识别器来标记文章叙述中的病例数、死亡数和住院数,从而获得0.753的F1分数。我们还以2014年西非埃博拉病毒病流行文章为例研究表明,有详细的时间序列数据不断更新,与地面真实数据密切一致。我们认为维基百科可以用来创建第一个社区驱动的开源新兴疾病检测、监测和存储系统。
{"title":"Eliciting Disease Data from Wikipedia Articles.","authors":"Geoffrey Fairchild, Sara Y Del Valle, Lalindra De Silva, Alberto M Segre","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Traditional disease surveillance systems suffer from several disadvantages, including reporting lags and antiquated technology, that have caused a movement towards internet-based disease surveillance systems. Internet systems are particularly attractive for disease outbreaks because they can provide data in near real-time and can be verified by individuals around the globe. However, most existing systems have focused on disease monitoring and do not provide a data repository for policy makers or researchers. In order to fill this gap, we analyzed Wikipedia article content. We demonstrate how a named-entity recognizer can be trained to tag case counts, death counts, and hospitalization counts in the article narrative that achieves an F1 score of 0.753. We also show, using the 2014 West African Ebola virus disease epidemic article as a case study, that there are detailed time series data that are consistently updated that closely align with ground truth data. We argue that Wikipedia can be used to create the first community-driven open-source emerging disease detection, monitoring, and repository system.</p>","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"2015 ","pages":"26-33"},"PeriodicalIF":0.0,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5511739/pdf/nihms875513.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35180937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Psychological Effects of Urban Crime Gleaned from Social Media 从社交媒体收集的城市犯罪心理影响
Jose Manuel Delgado Valdes, Jacob Eisenstein, M. Choudhury
Exposure to frequent crime incidents has been found to have a negative bearing on the well-being of city residents, even if they are not themselves a direct victim. We pursue the research question of whether naturalistic data shared on Twitter may provide a "lens" to understand changes in psychological attributes of urban communities (1) immediately following crime incidents, as well as (2) due to long-term exposure to crime. We analyze half a million Twitter posts from the City of Atlanta in 2014, where the rate of violent crime is three times of the national average. In a first study, we develop a statistical method to detect changes in social media psychological attributes in the immediate aftermath of a crime event. Second, we develop a regression model that uses historical (yearlong) crime to predict Twitter negative emotion, anxiety, anger, and sadness. We do not find significant changes in social media affect immediately following crime in Atlanta. However we do observe significant ability of historical crime to account for heightened negative emotion and anger in the future. Our findings have implications in gauging the utility of social media to infer longitudinal and population-scale patterns of urban well-being.
研究发现,频繁的犯罪事件对城市居民的幸福感有负面影响,即使他们自己不是直接受害者。我们追求的研究问题是,Twitter上分享的自然主义数据是否可以提供一个“镜头”来理解城市社区(1)犯罪事件发生后的心理属性变化,以及(2)由于长期暴露于犯罪。我们分析了2014年亚特兰大市的50万条推特帖子,那里的暴力犯罪率是全国平均水平的三倍。在第一项研究中,我们开发了一种统计方法来检测犯罪事件发生后社交媒体心理属性的变化。其次,我们开发了一个回归模型,使用历史(一年)犯罪来预测Twitter的负面情绪,焦虑,愤怒和悲伤。我们没有发现亚特兰大犯罪发生后社交媒体影响的显著变化。然而,我们确实观察到历史犯罪在解释未来的负面情绪和愤怒方面的显著能力。我们的研究结果对衡量社交媒体的效用,以推断城市幸福感的纵向和人口规模模式具有启示意义。
{"title":"Psychological Effects of Urban Crime Gleaned from Social Media","authors":"Jose Manuel Delgado Valdes, Jacob Eisenstein, M. Choudhury","doi":"10.1609/icwsm.v9i1.14665","DOIUrl":"https://doi.org/10.1609/icwsm.v9i1.14665","url":null,"abstract":"Exposure to frequent crime incidents has been found to have a negative bearing on the well-being of city residents, even if they are not themselves a direct victim. We pursue the research question of whether naturalistic data shared on Twitter may provide a \"lens\" to understand changes in psychological attributes of urban communities (1) immediately following crime incidents, as well as (2) due to long-term exposure to crime. We analyze half a million Twitter posts from the City of Atlanta in 2014, where the rate of violent crime is three times of the national average. In a first study, we develop a statistical method to detect changes in social media psychological attributes in the immediate aftermath of a crime event. Second, we develop a regression model that uses historical (yearlong) crime to predict Twitter negative emotion, anxiety, anger, and sadness. We do not find significant changes in social media affect immediately following crime in Atlanta. However we do observe significant ability of historical crime to account for heightened negative emotion and anger in the future. Our findings have implications in gauging the utility of social media to infer longitudinal and population-scale patterns of urban well-being.","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"69 1","pages":"598-601"},"PeriodicalIF":0.0,"publicationDate":"2015-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82608958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Eliciting Disease Data from Wikipedia Articles 从维基百科文章中获取疾病数据
Geoffrey Fairchild, Lalindra De Silva, S. D. Valle, Alberto Maria Segre
Traditional disease surveillance systems suffer from several disadvantages, including reporting lags and antiquated technology, that have caused a movement towards internet-based disease surveillance systems. Internet systems are particularly attractive for disease outbreaks because they can provide data in near real-time and can be verified by individuals around the globe. However, most existing systems have focused on disease monitoring and do not provide a data repository for policy makers or researchers. In order to fill this gap, we analyzed Wikipedia article content. We demonstrate how a named-entity recognizer can be trained to tag case counts, death counts, and hospitalization counts in the article narrative that achieves an F1 score of 0.753. We also show, using the 2014 West African Ebola virus disease epidemic article as a case study, that there are detailed time series data that are consistently updated that closely align with ground truth data. We argue that Wikipedia can be used to create the first community-driven open-source emerging disease detection, monitoring, and repository system.
传统的疾病监测系统存在一些缺点,包括报告滞后和技术过时,这导致了向基于互联网的疾病监测系统的转变。互联网系统对疾病暴发特别有吸引力,因为它们可以提供近乎实时的数据,并且可以由全球各地的个人进行验证。然而,大多数现有的系统都侧重于疾病监测,没有为决策者或研究人员提供数据存储库。为了填补这一空白,我们分析了维基百科的文章内容。我们演示了如何训练命名实体识别器来标记文章叙述中的病例数、死亡数和住院数,从而获得0.753的F1分数。我们还以2014年西非埃博拉病毒病流行文章为例研究表明,有详细的时间序列数据不断更新,与地面真实数据密切一致。我们认为维基百科可以用来创建第一个社区驱动的开源新兴疾病检测、监测和存储系统。
{"title":"Eliciting Disease Data from Wikipedia Articles","authors":"Geoffrey Fairchild, Lalindra De Silva, S. D. Valle, Alberto Maria Segre","doi":"10.5210/OJPHI.V8I1.6526","DOIUrl":"https://doi.org/10.5210/OJPHI.V8I1.6526","url":null,"abstract":"Traditional disease surveillance systems suffer from several disadvantages, including reporting lags and antiquated technology, that have caused a movement towards internet-based disease surveillance systems. Internet systems are particularly attractive for disease outbreaks because they can provide data in near real-time and can be verified by individuals around the globe. However, most existing systems have focused on disease monitoring and do not provide a data repository for policy makers or researchers. In order to fill this gap, we analyzed Wikipedia article content. We demonstrate how a named-entity recognizer can be trained to tag case counts, death counts, and hospitalization counts in the article narrative that achieves an F1 score of 0.753. We also show, using the 2014 West African Ebola virus disease epidemic article as a case study, that there are detailed time series data that are consistently updated that closely align with ground truth data. We argue that Wikipedia can be used to create the first community-driven open-source emerging disease detection, monitoring, and repository system.","PeriodicalId":74525,"journal":{"name":"Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media","volume":"106 1","pages":"26-33"},"PeriodicalIF":0.0,"publicationDate":"2015-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73523120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
Proceedings of the ... International AAAI Conference on Weblogs and Social Media. International AAAI Conference on Weblogs and Social Media
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1