Proceedings of the Internet Measurement Conference 2018最新文献

英文中文

On the Origins of Memes by Means of Fringe Web Communities 从边缘网络社区看模因的起源

Proceedings of the Internet Measurement Conference 2018

Pub Date : 2018-05-31 DOI: 10.1145/3278532.3278550

Savvas Zannettou, T. Caulfield, Jeremy Blackburn, Emiliano De Cristofaro, Michael Sirivianos, G. Stringhini, Guillermo Suarez-Tangil

Internet memes are increasingly used to sway and manipulate public opinion. This prompts the need to study their propagation, evolution, and influence across the Web. In this paper, we detect and measure the propagation of memes across multiple Web communities, using a processing pipeline based on perceptual hashing and clustering techniques, and a dataset of 160M images from 2.6B posts gathered from Twitter, Reddit, 4chan's Politically Incorrect board (/pol/), and Gab, over the course of 13 months. We group the images posted on fringe Web communities (/pol/, Gab, and The_Donald subreddit) into clusters, annotate them using meme metadata obtained from Know Your Meme, and also map images from mainstream communities (Twitter and Reddit) to the clusters. Our analysis provides an assessment of the popularity and diversity of memes in the context of each community, showing, e.g., that racist memes are extremely common in fringe Web communities. We also find a substantial number of politics-related memes on both mainstream and fringe Web communities, supporting media reports that memes might be used to enhance or harm politicians. Finally, we use Hawkes processes to model the interplay between Web communities and quantify their reciprocal influence, finding that /pol/ substantially influences the meme ecosystem with the number of memes it produces, while The_Donald has a higher success rate in pushing them to other communities.

网络表情包越来越多地被用来左右和操纵公众舆论。这促使我们有必要研究它们在整个网络中的传播、演变和影响。在本文中，我们使用基于感知哈希和聚类技术的处理管道，以及从Twitter, Reddit, 4chan's political Incorrect board (/pol/)和Gab收集的26条帖子中收集的1.6亿张图片的数据集，在13个月的时间里，检测和测量了模因在多个Web社区中的传播。我们将边缘网络社区(/pol/， Gab和The_Donald subreddit)上发布的图片分组，使用Know Your meme获得的meme元数据对其进行注释，并将主流社区(Twitter和Reddit)的图片映射到集群中。我们的分析对每个社区背景下模因的受欢迎程度和多样性进行了评估，结果显示，例如，种族主义模因在边缘网络社区中极为普遍。我们还在主流和边缘网络社区中发现了大量与政治相关的模因，这支持了媒体关于模因可能被用来提升或伤害政治家的报道。最后，我们使用Hawkes过程来模拟网络社区之间的相互作用，并量化它们的相互影响，发现/pol/通过其产生的模因数量实质性地影响了模因生态系统，而The_Donald在将它们推送到其他社区方面具有更高的成功率。

{"title":"On the Origins of Memes by Means of Fringe Web Communities","authors":"Savvas Zannettou, T. Caulfield, Jeremy Blackburn, Emiliano De Cristofaro, Michael Sirivianos, G. Stringhini, Guillermo Suarez-Tangil","doi":"10.1145/3278532.3278550","DOIUrl":"https://doi.org/10.1145/3278532.3278550","url":null,"abstract":"Internet memes are increasingly used to sway and manipulate public opinion. This prompts the need to study their propagation, evolution, and influence across the Web. In this paper, we detect and measure the propagation of memes across multiple Web communities, using a processing pipeline based on perceptual hashing and clustering techniques, and a dataset of 160M images from 2.6B posts gathered from Twitter, Reddit, 4chan's Politically Incorrect board (/pol/), and Gab, over the course of 13 months. We group the images posted on fringe Web communities (/pol/, Gab, and The_Donald subreddit) into clusters, annotate them using meme metadata obtained from Know Your Meme, and also map images from mainstream communities (Twitter and Reddit) to the clusters. Our analysis provides an assessment of the popularity and diversity of memes in the context of each community, showing, e.g., that racist memes are extremely common in fringe Web communities. We also find a substantial number of politics-related memes on both mainstream and fringe Web communities, supporting media reports that memes might be used to enhance or harm politicians. Finally, we use Hawkes processes to model the interplay between Web communities and quantify their reciprocal influence, finding that /pol/ substantially influences the meme ecosystem with the number of memes it produces, while The_Donald has a higher success rate in pushing them to other communities.","PeriodicalId":20640,"journal":{"name":"Proceedings of the Internet Measurement Conference 2018","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83622714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 215

A Long Way to the Top: Significance, Structure, and Stability of Internet Top Lists 通往巅峰的漫漫长路:互联网排行榜的意义、结构与稳定性

Proceedings of the Internet Measurement Conference 2018

Pub Date : 2018-05-29 DOI: 10.1145/3278532.3278574

Quirin Scheitle, O. Hohlfeld, Julien Gamba, Jonas Jelten, T. Zimmermann, Stephen D. Strowes, N. Vallina-Rodriguez

A broad range of research areas including Internet measurement, privacy, and network security rely on lists of target domains to be analysed; researchers make use of target lists for reasons of necessity or efficiency. The popular Alexa list of one million domains is a widely used example. Despite their prevalence in research papers, the soundness of top lists has seldom been questioned by the community: little is known about the lists' creation, representativity, potential biases, stability, or overlap between lists. In this study we survey the extent, nature, and evolution of top lists used by research communities. We assess the structure and stability of these lists, and show that rank manipulation is possible for some lists. We also reproduce the results of several scientific studies to assess the impact of using a top list at all, which list specifically, and the date of list creation. We find that (i) top lists generally overestimate results compared to the general population by a significant margin, often even an order of magnitude, and (ii) some top lists have surprising change characteristics, causing high day-to-day fluctuation and leading to result instability. We conclude our paper with specific recommendations on the use of top lists, and how to interpret results based on top lists with caution.

包括互联网测量、隐私和网络安全在内的广泛研究领域依赖于要分析的目标域列表;研究人员出于必要性或效率的原因使用目标列表。受欢迎的Alexa 100万个域名列表就是一个被广泛使用的例子。尽管它们在研究论文中很流行，但顶级榜单的合理性很少受到社区的质疑:人们对榜单的创建、代表性、潜在偏见、稳定性或榜单之间的重叠知之甚少。在这项研究中，我们调查了研究团体使用的顶级榜单的范围、性质和演变。我们评估了这些列表的结构和稳定性，并表明对一些列表进行排名操作是可能的。我们还重现了几项科学研究的结果，以评估使用顶级榜单的影响，具体是哪个榜单，以及榜单创建的日期。我们发现(i)与一般人群相比，排名靠前的名单通常高估了结果，通常甚至是一个数量级，并且(ii)一些排名靠前的名单具有惊人的变化特征，导致日常波动很大，导致结果不稳定。最后，我们对top list的使用提出了具体的建议，以及如何谨慎地解释基于top list的结果。

{"title":"A Long Way to the Top: Significance, Structure, and Stability of Internet Top Lists","authors":"Quirin Scheitle, O. Hohlfeld, Julien Gamba, Jonas Jelten, T. Zimmermann, Stephen D. Strowes, N. Vallina-Rodriguez","doi":"10.1145/3278532.3278574","DOIUrl":"https://doi.org/10.1145/3278532.3278574","url":null,"abstract":"A broad range of research areas including Internet measurement, privacy, and network security rely on lists of target domains to be analysed; researchers make use of target lists for reasons of necessity or efficiency. The popular Alexa list of one million domains is a widely used example. Despite their prevalence in research papers, the soundness of top lists has seldom been questioned by the community: little is known about the lists' creation, representativity, potential biases, stability, or overlap between lists. In this study we survey the extent, nature, and evolution of top lists used by research communities. We assess the structure and stability of these lists, and show that rank manipulation is possible for some lists. We also reproduce the results of several scientific studies to assess the impact of using a top list at all, which list specifically, and the date of list creation. We find that (i) top lists generally overestimate results compared to the general population by a significant margin, often even an order of magnitude, and (ii) some top lists have surprising change characteristics, causing high day-to-day fluctuation and leading to result instability. We conclude our paper with specific recommendations on the use of top lists, and how to interpret results based on top lists with caution.","PeriodicalId":20640,"journal":{"name":"Proceedings of the Internet Measurement Conference 2018","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2018-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73576621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 136

In the IP of the Beholder: Strategies for Active IPv6 Topology Discovery 在旁观者的IP:主动IPv6拓扑发现策略

Proceedings of the Internet Measurement Conference 2018

Pub Date : 2018-05-29 DOI: 10.1145/3278532.3278559

Robert Beverly, Ramakrishnan Durairajan, D. Plonka, Justin P. Rohrer

Existing methods for active topology discovery within the IPv6 Internet largely mirror those of IPv4. In light of the large and sparsely populated address space, in conjunction with aggressive ICMPv6 rate limiting by routers, this work develops a different approach to Internet-wide IPv6 topology mapping. We adopt randomized probing techniques in order to distribute probing load, minimize the effects of rate limiting, and probe at higher rates. Second, we extensively analyze the efficiency and efficacy of various IPv6 hitlists and target generation methods when used for topology discovery, and synthesize new target lists based on our empirical results to provide both breadth (coverage across networks) and depth (to find potential subnetting). Employing our probing strategy, we discover more than 1.3M IPv6 router interface addresses from a single vantage point. Finally, we share our prober implementation, synthesized target lists, and discovered IPv6 topology results.

IPv6互联网中现有的主动拓扑发现方法在很大程度上反映了IPv4的方法。鉴于大而稀疏的地址空间，结合路由器对ICMPv6速率的限制，本工作开发了一种不同的方法来实现互联网范围内的IPv6拓扑映射。我们采用随机探测技术，以分配探测负载，最小化速率限制的影响，并以更高的速率探测。其次，我们广泛分析了用于拓扑发现的各种IPv6命中列表和目标生成方法的效率和功效，并根据我们的经验结果合成新的目标列表，以提供广度(跨网络覆盖)和深度(寻找潜在的子网)。利用我们的探测策略，我们从一个有利位置发现了超过130万个IPv6路由器接口地址。最后，我们将分享我们的探测器实现、合成目标列表和发现的IPv6拓扑结果。

引用次数: 51

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Internet Measurement Conference 2018

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀