首页 > 最新文献

arXiv - CS - Social and Information Networks最新文献

英文 中文
Incorporating Like-Minded Peers to Overcome Friend Data Sparsity in Session-Based Social Recommendations 在基于会话的社交推荐中纳入志同道合的同伴,克服好友数据稀缺问题
Pub Date : 2024-09-04 DOI: arxiv-2409.02702
Chunyan An, Yunhan Li, Qiang Yang, Winston K. G. Seah, Zhixu Li, Conghao Yanga
Session-based Social Recommendation (SSR) leverages social relationshipswithin online networks to enhance the performance of Session-basedRecommendation (SR). However, existing SSR algorithms often encounter thechallenge of ``friend data sparsity''. Moreover, significant discrepancies canexist between the purchase preferences of social network friends and those ofthe target user, reducing the influence of friends relative to the targetuser's own preferences. To address these challenges, this paper introduces theconcept of ``Like-minded Peers'' (LMP), representing users whose preferencesalign with the target user's current session based on their historicalsessions. This is the first work, to our knowledge, that uses LMP to enhancethe modeling of social influence in SSR. This approach not only alleviates theproblem of friend data sparsity but also effectively incorporates users withsimilar preferences to the target user. We propose a novel model namedTransformer Encoder with Graph Attention Aggregator Recommendation (TEGAARec),which includes the TEGAA module and the GAT-based social aggregation module.The TEGAA module captures and merges both long-term and short-term interestsfor target users and LMP users. Concurrently, the GAT-based social aggregationmodule is designed to aggregate the target users' dynamic interests and socialinfluence in a weighted manner. Extensive experiments on four real-worlddatasets demonstrate the efficacy and superiority of our proposed model andablation studies are done to illustrate the contributions of each component inTEGAARec.
基于会话的社交推荐(SSR)利用在线网络中的社交关系来提高基于会话的推荐(SR)的性能。然而,现有的会话社交推荐算法经常遇到 "好友数据稀少 "的挑战。此外,社交网络好友的购买偏好与目标用户的购买偏好之间可能存在巨大差异,从而降低了好友相对于目标用户自身偏好的影响力。为了应对这些挑战,本文引入了 "志同道合的同伴"(LMP)的概念,根据目标用户的历史会话,代表其偏好与目标用户当前会话一致的用户。据我们所知,这是第一项使用 LMP 来增强 SSR 中社会影响力建模的工作。这种方法不仅缓解了好友数据稀少的问题,还有效地将与目标用户具有相似偏好的用户纳入其中。我们提出了一种名为 "图关注聚合推荐"(TEGAARec)的新型模型,其中包括 TEGAA 模块和基于 GAT 的社交聚合模块。同时,基于 GAT 的社交聚合模块旨在以加权方式聚合目标用户的动态兴趣和社交影响力。在四个真实世界数据集上进行的广泛实验证明了我们提出的模型的有效性和优越性,并进行了相关研究,以说明 TEGAARec 中每个组件的贡献。
{"title":"Incorporating Like-Minded Peers to Overcome Friend Data Sparsity in Session-Based Social Recommendations","authors":"Chunyan An, Yunhan Li, Qiang Yang, Winston K. G. Seah, Zhixu Li, Conghao Yanga","doi":"arxiv-2409.02702","DOIUrl":"https://doi.org/arxiv-2409.02702","url":null,"abstract":"Session-based Social Recommendation (SSR) leverages social relationships\u0000within online networks to enhance the performance of Session-based\u0000Recommendation (SR). However, existing SSR algorithms often encounter the\u0000challenge of ``friend data sparsity''. Moreover, significant discrepancies can\u0000exist between the purchase preferences of social network friends and those of\u0000the target user, reducing the influence of friends relative to the target\u0000user's own preferences. To address these challenges, this paper introduces the\u0000concept of ``Like-minded Peers'' (LMP), representing users whose preferences\u0000align with the target user's current session based on their historical\u0000sessions. This is the first work, to our knowledge, that uses LMP to enhance\u0000the modeling of social influence in SSR. This approach not only alleviates the\u0000problem of friend data sparsity but also effectively incorporates users with\u0000similar preferences to the target user. We propose a novel model named\u0000Transformer Encoder with Graph Attention Aggregator Recommendation (TEGAARec),\u0000which includes the TEGAA module and the GAT-based social aggregation module.\u0000The TEGAA module captures and merges both long-term and short-term interests\u0000for target users and LMP users. Concurrently, the GAT-based social aggregation\u0000module is designed to aggregate the target users' dynamic interests and social\u0000influence in a weighted manner. Extensive experiments on four real-world\u0000datasets demonstrate the efficacy and superiority of our proposed model and\u0000ablation studies are done to illustrate the contributions of each component in\u0000TEGAARec.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142227853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Calls to Action in Multimodal Content: Analysis of the 2021 German Federal Election Campaign on Instagram 检测多模态内容中的行动号召:分析 Instagram 上的 2021 年德国联邦大选活动
Pub Date : 2024-09-04 DOI: arxiv-2409.02690
Michael Achmann-Denkler, Jakob Fehle, Mario Haim, Christian Wolff
This study investigates the automated classification of Calls to Action(CTAs) within the 2021 German Instagram election campaign to advance theunderstanding of mobilization in social media contexts. We analyzed over 2,208Instagram stories and 712 posts using fine-tuned BERT models and OpenAI's GPT-4models. The fine-tuned BERT model incorporating synthetic training dataachieved a macro F1 score of 0.93, demonstrating a robust classificationperformance. Our analysis revealed that 49.58% of Instagram posts and 10.64% ofstories contained CTAs, highlighting significant differences in mobilizationstrategies between these content types. Additionally, we found that FDP and theGreens had the highest prevalence of CTAs in posts, whereas CDU and CSU led instory CTAs.
本研究调查了 2021 年德国 Instagram 选举活动中行动号召(CTA)的自动分类,以促进对社交媒体语境下动员的理解。我们使用微调 BERT 模型和 OpenAI 的 GPT-4 模型分析了超过 2,208 个 Instagram 故事和 712 个帖子。结合合成训练数据的微调 BERT 模型获得了 0.93 的宏观 F1 分数,显示出强大的分类性能。我们的分析表明,49.58% 的 Instagram 帖子和 10.64% 的故事包含 CTA,这突显了这些内容类型在动员策略上的显著差异。此外,我们发现 FDP 和绿党在帖子中使用 CTA 的比例最高,而基民盟和基社盟在故事中使用 CTA 的比例最高。
{"title":"Detecting Calls to Action in Multimodal Content: Analysis of the 2021 German Federal Election Campaign on Instagram","authors":"Michael Achmann-Denkler, Jakob Fehle, Mario Haim, Christian Wolff","doi":"arxiv-2409.02690","DOIUrl":"https://doi.org/arxiv-2409.02690","url":null,"abstract":"This study investigates the automated classification of Calls to Action\u0000(CTAs) within the 2021 German Instagram election campaign to advance the\u0000understanding of mobilization in social media contexts. We analyzed over 2,208\u0000Instagram stories and 712 posts using fine-tuned BERT models and OpenAI's GPT-4\u0000models. The fine-tuned BERT model incorporating synthetic training data\u0000achieved a macro F1 score of 0.93, demonstrating a robust classification\u0000performance. Our analysis revealed that 49.58% of Instagram posts and 10.64% of\u0000stories contained CTAs, highlighting significant differences in mobilization\u0000strategies between these content types. Additionally, we found that FDP and the\u0000Greens had the highest prevalence of CTAs in posts, whereas CDU and CSU led in\u0000story CTAs.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226902","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic 什么是正常?匿名互联网流量的大数据观察科学模型
Pub Date : 2024-09-04 DOI: arxiv-2409.03111
Jeremy Kepner, Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Gabriel Wachman, Charles Yee, Peter Michaleas
Understanding what is normal is a key aspect of protecting a domain. Otherdomains invest heavily in observational science to develop models of normalbehavior to better detect anomalies. Recent advances in high performance graphlibraries, such as the GraphBLAS, coupled with supercomputers enablesprocessing of the trillions of observations required. We leverage this approachto synthesize low-parameter observational models of anonymized Internet trafficwith a high regard for privacy.
了解什么是正常行为是保护域的一个关键方面。其他域在观测科学方面投入了大量资金,以开发正常行为模型,从而更好地检测异常情况。高性能图形库(如 GraphBLAS)的最新进展与超级计算机相结合,可以处理所需的数万亿观测数据。我们利用这种方法合成了匿名互联网流量的低参数观测模型,并高度关注隐私问题。
{"title":"What is Normal? A Big Data Observational Science Model of Anonymized Internet Traffic","authors":"Jeremy Kepner, Hayden Jananthan, Michael Jones, William Arcand, David Bestor, William Bergeron, Daniel Burrill, Aydin Buluc, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Piotr Luszczek, Lauren Milechin, Chasen Milner, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Gabriel Wachman, Charles Yee, Peter Michaleas","doi":"arxiv-2409.03111","DOIUrl":"https://doi.org/arxiv-2409.03111","url":null,"abstract":"Understanding what is normal is a key aspect of protecting a domain. Other\u0000domains invest heavily in observational science to develop models of normal\u0000behavior to better detect anomalies. Recent advances in high performance graph\u0000libraries, such as the GraphBLAS, coupled with supercomputers enables\u0000processing of the trillions of observations required. We leverage this approach\u0000to synthesize low-parameter observational models of anonymized Internet traffic\u0000with a high regard for privacy.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Do We Trust What They Say or What They Do? A Multimodal User Embedding Provides Personalized Explanations 我们相信他们说的话还是做的事?多模态用户嵌入提供个性化解释
Pub Date : 2024-09-04 DOI: arxiv-2409.02965
Zhicheng Ren, Zhiping Xiao, Yizhou Sun
With the rapid development of social media, the importance of analyzingsocial network user data has also been put on the agenda. User representationlearning in social media is a critical area of research, based on which we canconduct personalized content delivery, or detect malicious actors. Being morecomplicated than many other types of data, social network user data hasinherent multimodal nature. Various multimodal approaches have been proposed toharness both text (i.e. post content) and relation (i.e. inter-userinteraction) information to learn user embeddings of higher quality. The adventof Graph Neural Network models enables more end-to-end integration of user textembeddings and user interaction graphs in social networks. However, most ofthose approaches do not adequately elucidate which aspects of the data - textor graph structure information - are more helpful for predicting each specificuser under a particular task, putting some burden on personalized downstreamanalysis and untrustworthy information filtering. We propose a simple yeteffective framework called Contribution-Aware Multimodal User Embedding (CAMUE)for social networks. We have demonstrated with empirical evidence, that ourapproach can provide personalized explainable predictions, automaticallymitigating the impact of unreliable information. We also conducted case studiesto show how reasonable our results are. We observe that for most users, graphstructure information is more trustworthy than text information, but there aresome reasonable cases where text helps more. Our work paves the way for moreexplainable, reliable, and effective social media user embedding which allowsfor better personalized content delivery.
随着社交媒体的快速发展,分析社交网络用户数据的重要性也被提上日程。社交媒体中的用户表征学习是一个重要的研究领域,在此基础上,我们可以进行个性化内容推送或检测恶意行为者。与许多其他类型的数据相比,社交网络用户数据更为复杂,具有固有的多模态特性。为了利用文本(即帖子内容)和关系(即用户间互动)信息来学习更高质量的用户嵌入,人们提出了各种多模态方法。图神经网络模型的出现使用户文本嵌入和用户交互图在社交网络中的端到端整合成为可能。然而,这些方法大多没有充分阐明数据的哪些方面--文本或图结构信息--对预测特定任务下的每个特定用户更有帮助,这给个性化下游分析和不可信信息过滤带来了一定的负担。我们为社交网络提出了一个简单而有效的框架,称为 "贡献感知多模态用户嵌入(CAMUE)"。我们通过实证证明,我们的方法可以提供个性化的可解释预测,自动减轻不可靠信息的影响。我们还进行了案例研究,以证明我们的结果是多么合理。我们发现,对于大多数用户来说,图形结构信息比文本信息更可信,但在某些合理的情况下,文本信息的帮助更大。我们的工作为更可解释、更可靠、更有效的社交媒体用户嵌入铺平了道路,从而可以更好地提供个性化内容。
{"title":"Do We Trust What They Say or What They Do? A Multimodal User Embedding Provides Personalized Explanations","authors":"Zhicheng Ren, Zhiping Xiao, Yizhou Sun","doi":"arxiv-2409.02965","DOIUrl":"https://doi.org/arxiv-2409.02965","url":null,"abstract":"With the rapid development of social media, the importance of analyzing\u0000social network user data has also been put on the agenda. User representation\u0000learning in social media is a critical area of research, based on which we can\u0000conduct personalized content delivery, or detect malicious actors. Being more\u0000complicated than many other types of data, social network user data has\u0000inherent multimodal nature. Various multimodal approaches have been proposed to\u0000harness both text (i.e. post content) and relation (i.e. inter-user\u0000interaction) information to learn user embeddings of higher quality. The advent\u0000of Graph Neural Network models enables more end-to-end integration of user text\u0000embeddings and user interaction graphs in social networks. However, most of\u0000those approaches do not adequately elucidate which aspects of the data - text\u0000or graph structure information - are more helpful for predicting each specific\u0000user under a particular task, putting some burden on personalized downstream\u0000analysis and untrustworthy information filtering. We propose a simple yet\u0000effective framework called Contribution-Aware Multimodal User Embedding (CAMUE)\u0000for social networks. We have demonstrated with empirical evidence, that our\u0000approach can provide personalized explainable predictions, automatically\u0000mitigating the impact of unreliable information. We also conducted case studies\u0000to show how reasonable our results are. We observe that for most users, graph\u0000structure information is more trustworthy than text information, but there are\u0000some reasonable cases where text helps more. Our work paves the way for more\u0000explainable, reliable, and effective social media user embedding which allows\u0000for better personalized content delivery.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FastEnsemble: A new scalable ensemble clustering method FastEnsemble:一种新的可扩展集合聚类方法
Pub Date : 2024-09-03 DOI: arxiv-2409.02077
Yasamin Tabatabaee, Eleanor Wedell, Minhyuk Park, Tandy Warnow
Many community detection algorithms are stochastic in nature, and theiroutput can vary based on different input parameters and random seeds. Consensusclustering methods, such as FastConsensus and ECG, combine clusterings frommultiple runs of the same clustering algorithm, in order to improve stabilityand accuracy. In this study we present a new consensus clustering method,FastEnsemble, and show that it provides advantages over both FastConsensus andECG. Furthermore, FastEnsemble is designed for use with any clustering method,and we show results using ourmethod with Leiden optimizing modularity or theConstant Potts model. FastEnsemble is available in Github athttps://github.com/ytabatabaee/fast-ensemble
许多群落检测算法都具有随机性,其输出结果会根据不同的输入参数和随机种子而变化。共识聚类方法,如 FastConsensus 和 ECG,将同一聚类算法多次运行的聚类结果结合起来,以提高稳定性和准确性。在这项研究中,我们提出了一种新的共识聚类方法--FastEnsemble,并证明它比 FastConsensus 和 ECG 都更有优势。此外,FastEnsemble还可以与任何聚类方法一起使用,我们展示了使用莱顿优化模块化或康斯坦茨-波茨模型(Constant Potts model)的结果。FastEnsemble可在Github上下载:https://github.com/ytabatabaee/fast-ensemble。
{"title":"FastEnsemble: A new scalable ensemble clustering method","authors":"Yasamin Tabatabaee, Eleanor Wedell, Minhyuk Park, Tandy Warnow","doi":"arxiv-2409.02077","DOIUrl":"https://doi.org/arxiv-2409.02077","url":null,"abstract":"Many community detection algorithms are stochastic in nature, and their\u0000output can vary based on different input parameters and random seeds. Consensus\u0000clustering methods, such as FastConsensus and ECG, combine clusterings from\u0000multiple runs of the same clustering algorithm, in order to improve stability\u0000and accuracy. In this study we present a new consensus clustering method,\u0000FastEnsemble, and show that it provides advantages over both FastConsensus and\u0000ECG. Furthermore, FastEnsemble is designed for use with any clustering method,\u0000and we show results using ourmethod with Leiden optimizing modularity or the\u0000Constant Potts model. FastEnsemble is available in Github at\u0000https://github.com/ytabatabaee/fast-ensemble","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Preserving the Ephemeral: Instagram Story Archiving with the Tidal Tales Plugin 保存短暂:用潮汐故事插件存档 Instagram 故事
Pub Date : 2024-09-03 DOI: arxiv-2409.01880
Michael Achmann-Denkler, Christian Wolff
We introduce the Tidal Tales Plugin, a Firefox extension for efficientlycollecting and archiving of Instagram stories, addressing the challenges ofephemeral data in social media research. It enables an automated collection ofstory metadata and media files without risking account bans. It contributes toWeb Science by facilitating expansive, long-term studies with enhanced dataaccess and integrity.
我们介绍 Tidal Tales Plugin,这是一款用于高效收集和归档 Instagram 故事的 Firefox 扩展程序,可解决社交媒体研究中短暂数据带来的挑战。该插件可自动收集故事元数据和媒体文件,而无需冒账户被封的风险。它通过增强数据访问性和完整性,促进了广泛的长期研究,从而为网络科学做出了贡献。
{"title":"Preserving the Ephemeral: Instagram Story Archiving with the Tidal Tales Plugin","authors":"Michael Achmann-Denkler, Christian Wolff","doi":"arxiv-2409.01880","DOIUrl":"https://doi.org/arxiv-2409.01880","url":null,"abstract":"We introduce the Tidal Tales Plugin, a Firefox extension for efficiently\u0000collecting and archiving of Instagram stories, addressing the challenges of\u0000ephemeral data in social media research. It enables an automated collection of\u0000story metadata and media files without risking account bans. It contributes to\u0000Web Science by facilitating expansive, long-term studies with enhanced data\u0000access and integrity.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fair Railway Network Design 公平铁路网设计
Pub Date : 2024-09-03 DOI: arxiv-2409.02152
Zixu He, Sirin Botan, Jérôme Lang, Abdallah Saffidine, Florian Sikora, Silas Workman
When designing a public transportation network in a country, one may want tominimise the sum of travel duration of all inhabitants. This corresponds to apurely utilitarian view and does not involve any fairness consideration, as theresulting network will typically benefit the capital city and/or large centralcities while leaving some peripheral cities behind. On the other hand, a moreegalitarian view will allow some people to travel between peripheral citieswithout having to go through a central city. We define a model, proposealgorithms for computing solution networks, and report on experiments based onreal data.
在设计一个国家的公共交通网络时,人们可能希望将所有居民的旅行时间总和最小化。这相当于纯粹的功利主义观点,不涉及任何公平考虑,因为设计出的网络通常会有利于首都和/或大型中心城市,而将一些边缘城市甩在身后。另一方面,一种更公平的观点则允许一些人在外围城市之间旅行,而无需经过中心城市。我们定义了一个模型,提出了计算解决方案网络的算法,并报告了基于真实数据的实验结果。
{"title":"Fair Railway Network Design","authors":"Zixu He, Sirin Botan, Jérôme Lang, Abdallah Saffidine, Florian Sikora, Silas Workman","doi":"arxiv-2409.02152","DOIUrl":"https://doi.org/arxiv-2409.02152","url":null,"abstract":"When designing a public transportation network in a country, one may want to\u0000minimise the sum of travel duration of all inhabitants. This corresponds to a\u0000purely utilitarian view and does not involve any fairness consideration, as the\u0000resulting network will typically benefit the capital city and/or large central\u0000cities while leaving some peripheral cities behind. On the other hand, a more\u0000egalitarian view will allow some people to travel between peripheral cities\u0000without having to go through a central city. We define a model, propose\u0000algorithms for computing solution networks, and report on experiments based on\u0000real data.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wikipedia in Wartime: Experiences of Wikipedians Maintaining Articles About the Russia-Ukraine War 战时的维基百科:维基人维护俄乌战争相关文章的经验
Pub Date : 2024-09-03 DOI: arxiv-2409.02304
Laura Kurek, Ceren Budak, Eric Gilbert
How do Wikipedians maintain an accurate encyclopedia during an ongoinggeopolitical conflict where state actors might seek to spread disinformation orconduct an information operation? In the context of the Russia-Ukraine War,this question becomes more pressing, given the Russian government's extensivehistory of orchestrating information campaigns. We conducted an interview studywith 13 expert Wikipedians involved in the Russo-Ukrainian War topic area onthe English-language edition of Wikipedia. While our participants did notperceive there to be clear evidence of a state-backed information operation,they agreed that war-related articles experienced high levels of disruptiveediting from both Russia-aligned and Ukraine-aligned accounts. TheEnglish-language edition of Wikipedia had existing policies and processes atits disposal to counter such disruption. State-backed or not, the disruptiveactivity created time-intensive maintenance work for our participants. Finally,participants considered English-language Wikipedia to be more resilient thansocial media in preventing the spread of false information online. We concludeby discussing sociotechnical implications for Wikipedia and social platforms.
在持续的政治冲突中,国家行为者可能会试图散布虚假信息或开展信息行动,维基人该如何维护一部准确的百科全书?在俄乌战争的背景下,鉴于俄罗斯政府在策划信息活动方面的丰富历史,这个问题变得更加紧迫。我们对维基百科英文版中参与俄乌战争主题领域的 13 位维基专家进行了访谈研究。虽然我们的参与者并不认为有明确的证据表明存在国家支持的信息行动,但他们一致认为,与战争相关的文章经历了来自与俄罗斯和乌克兰结盟的账户的大量破坏性编辑。维基百科英文版拥有现有的政策和程序来应对这种破坏。无论是否有国家支持,破坏性活动都为我们的参与者带来了时间密集型的维护工作。最后,参与者认为在防止虚假信息在网上传播方面,英语维基百科比社交媒体更有弹性。最后,我们讨论了社会技术对维基百科和社交平台的影响。
{"title":"Wikipedia in Wartime: Experiences of Wikipedians Maintaining Articles About the Russia-Ukraine War","authors":"Laura Kurek, Ceren Budak, Eric Gilbert","doi":"arxiv-2409.02304","DOIUrl":"https://doi.org/arxiv-2409.02304","url":null,"abstract":"How do Wikipedians maintain an accurate encyclopedia during an ongoing\u0000geopolitical conflict where state actors might seek to spread disinformation or\u0000conduct an information operation? In the context of the Russia-Ukraine War,\u0000this question becomes more pressing, given the Russian government's extensive\u0000history of orchestrating information campaigns. We conducted an interview study\u0000with 13 expert Wikipedians involved in the Russo-Ukrainian War topic area on\u0000the English-language edition of Wikipedia. While our participants did not\u0000perceive there to be clear evidence of a state-backed information operation,\u0000they agreed that war-related articles experienced high levels of disruptive\u0000editing from both Russia-aligned and Ukraine-aligned accounts. The\u0000English-language edition of Wikipedia had existing policies and processes at\u0000its disposal to counter such disruption. State-backed or not, the disruptive\u0000activity created time-intensive maintenance work for our participants. Finally,\u0000participants considered English-language Wikipedia to be more resilient than\u0000social media in preventing the spread of false information online. We conclude\u0000by discussing sociotechnical implications for Wikipedia and social platforms.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142226909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A dataset of Open Source Intelligence (OSINT) Tweets about the Russo-Ukrainian war 有关俄乌战争的开放源代码情报 (OSINT) 推文数据集
Pub Date : 2024-09-02 DOI: arxiv-2409.01052
Johannes Niu, Mila Stillman, Philipp Seeberger, Anna Kruspe
Open Source Intelligence (OSINT) refers to intelligence efforts based onfreely available data. It has become a frequent topic of conversation on socialmedia, where private users or networks can share their findings. Such data ishighly valuable in conflicts, both for gaining a new understanding of thesituation as well as for tracking the spread of misinformation. In this paper,we present a method for collecting such data as well as a novel OSINT datasetfor the Russo-Ukrainian war drawn from Twitter between January 2022 and July2023. It is based on an initial search of users posting OSINT and a subsequentsnowballing approach to detect more. The final dataset contains almost 2million Tweets posted by 1040 users. We also provide some first analyses andexperiments on the data, and make suggestions for its future usage.
开源情报(Open Source Intelligence,OSINT)是指基于可自由获取的数据的情报工作。它已成为社交媒体上的热门话题,私人用户或网络可以在社交媒体上分享他们的发现。这些数据在冲突中具有很高的价值,既能让人对局势有新的认识,又能追踪错误信息的传播。在本文中,我们介绍了一种收集此类数据的方法,并从 2022 年 1 月至 2023 年 7 月期间的 Twitter 中为俄乌战争提供了一个新颖的 OSINT 数据集。该数据集基于对发布 OSINT 的用户的初始搜索,以及随后的 "滚雪球 "方法来检测更多的用户。最终数据集包含 1040 个用户发布的近 200 万条推文。我们还对这些数据进行了初步分析和实验,并对其未来的使用提出了建议。
{"title":"A dataset of Open Source Intelligence (OSINT) Tweets about the Russo-Ukrainian war","authors":"Johannes Niu, Mila Stillman, Philipp Seeberger, Anna Kruspe","doi":"arxiv-2409.01052","DOIUrl":"https://doi.org/arxiv-2409.01052","url":null,"abstract":"Open Source Intelligence (OSINT) refers to intelligence efforts based on\u0000freely available data. It has become a frequent topic of conversation on social\u0000media, where private users or networks can share their findings. Such data is\u0000highly valuable in conflicts, both for gaining a new understanding of the\u0000situation as well as for tracking the spread of misinformation. In this paper,\u0000we present a method for collecting such data as well as a novel OSINT dataset\u0000for the Russo-Ukrainian war drawn from Twitter between January 2022 and July\u00002023. It is based on an initial search of users posting OSINT and a subsequent\u0000snowballing approach to detect more. The final dataset contains almost 2\u0000million Tweets posted by 1040 users. We also provide some first analyses and\u0000experiments on the data, and make suggestions for its future usage.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Polaris: Sampling from the Multigraph Configuration Model with Prescribed Color Assortativity 北极星从多图谱配置模型中采样,规定颜色同类性
Pub Date : 2024-09-02 DOI: arxiv-2409.01363
Giulia Preti, Matteo Riondato, Aristides Gionis, Gianmarco De Francisci Morales
We introduce Polaris, a network null model for colored multi-graphs thatpreserves the Joint Color Matrix. Polaris is specifically designed for studyingnetwork polarization, where vertices belong to a side in a debate or a partisangroup, represented by a vertex color, and relations have different strengths,represented by an integer-valued edge multiplicity. The key feature of Polarisis preserving the Joint Color Matrix (JCM) of the multigraph, which specifiesthe number of edges connecting vertices of any two given colors. The JCM is thebasic property that determines color assortativity, a fundamental aspect instudying homophily and segregation in polarized networks. By using Polaris,network scientists can test whether a phenomenon is entirely explained by theJCM of the observed network or whether other phenomena might be at play.Technically, our null model is an extension of the configuration model: anensemble of colored multigraphs characterized by the same degree sequence andthe same JCM. To sample from this ensemble, we develop a suite of Markov ChainMonte Carlo algorithms, collectively named Polaris-*. It includes Polaris-B, anadaptation of a generic Metropolis-Hastings algorithm, and Polaris-C, a faster,specialized algorithm with higher acceptance probabilities. This new null modeland the associated algorithms provide a more nuanced toolset for examiningpolarization in social networks, thus enabling statistically sound conclusions.
我们介绍的 Polaris 是一种保留了联合颜色矩阵的彩色多图网络空模型。Polaris 是专为研究网络极化而设计的,其中顶点属于辩论中的一方或党派,用顶点颜色表示,而关系具有不同的强度,用整数值的边倍率表示。Polaris 的主要特点是保留多图的联合颜色矩阵(JCM),它规定了连接任意两种给定颜色顶点的边的数量。联合颜色矩阵是决定颜色同类性的基本属性,是研究极化网络中同质性和隔离性的一个基本方面。通过使用 Polaris,网络科学家们可以检验一种现象是否完全可以用观察到的网络的 JCM 来解释,或者是否有其他现象在起作用。从技术上讲,我们的空模型是配置模型的扩展:由具有相同度序列和相同 JCM 的彩色多图组成的集合。为了从这个集合中采样,我们开发了一套马尔可夫链蒙特卡洛算法,统称为 Polaris-*。它包括 Polaris-B(通用 Metropolis-Hastings 算法的改编版)和 Polaris-C(速度更快、接受概率更高的专用算法)。这一新的空模型和相关算法为研究社交网络中的极化现象提供了一个更细致的工具集,从而能够得出统计上合理的结论。
{"title":"Polaris: Sampling from the Multigraph Configuration Model with Prescribed Color Assortativity","authors":"Giulia Preti, Matteo Riondato, Aristides Gionis, Gianmarco De Francisci Morales","doi":"arxiv-2409.01363","DOIUrl":"https://doi.org/arxiv-2409.01363","url":null,"abstract":"We introduce Polaris, a network null model for colored multi-graphs that\u0000preserves the Joint Color Matrix. Polaris is specifically designed for studying\u0000network polarization, where vertices belong to a side in a debate or a partisan\u0000group, represented by a vertex color, and relations have different strengths,\u0000represented by an integer-valued edge multiplicity. The key feature of Polaris\u0000is preserving the Joint Color Matrix (JCM) of the multigraph, which specifies\u0000the number of edges connecting vertices of any two given colors. The JCM is the\u0000basic property that determines color assortativity, a fundamental aspect in\u0000studying homophily and segregation in polarized networks. By using Polaris,\u0000network scientists can test whether a phenomenon is entirely explained by the\u0000JCM of the observed network or whether other phenomena might be at play.\u0000Technically, our null model is an extension of the configuration model: an\u0000ensemble of colored multigraphs characterized by the same degree sequence and\u0000the same JCM. To sample from this ensemble, we develop a suite of Markov Chain\u0000Monte Carlo algorithms, collectively named Polaris-*. It includes Polaris-B, an\u0000adaptation of a generic Metropolis-Hastings algorithm, and Polaris-C, a faster,\u0000specialized algorithm with higher acceptance probabilities. This new null model\u0000and the associated algorithms provide a more nuanced toolset for examining\u0000polarization in social networks, thus enabling statistically sound conclusions.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142214729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - CS - Social and Information Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1