首页 > 最新文献

Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining最新文献

英文 中文
Characterizing video-based online information environment using VTracker 利用VTracker表征基于视频的在线信息环境
Thomas Marcoux, Oluwaseyi Adeliyi, Dr Nidhi Agarwal
YouTube is the second most popular website on the internet and a major actor in information propagation, therefore making it efficient as a potential vehicle of misinformation. Current tools available for video platforms tend to hyperfocus on metadata aggregation and neglect the analysis of the actual videos. In an attempt to provide analysts the tools they need to perform various research (behavioral, political analysis, sociology,etc.), we present VTracker (formerly YouTubeTracker), an online analytical tool. Some of the insight analysts can derive from this tool are inorganic behavior detection and algorithmic manipulation. We aim to make the analysis of YouTube content and user behavior accessible not only to information scientists but also communication researchers, journalists, sociologists, and many more. We demonstrate the utility of the tool through some real world data samples.
YouTube是互联网上第二大最受欢迎的网站,也是信息传播的主要参与者,因此它作为错误信息的潜在载体是有效的。目前用于视频平台的工具往往过于关注元数据聚合,而忽略了对实际视频的分析。为了提供分析人员进行各种研究(行为,政治分析,社会学等)所需的工具,我们提出了VTracker(以前的YouTubeTracker),一个在线分析工具。分析人员可以从这个工具中获得一些洞察力,包括无机行为检测和算法操作。我们的目标是使对YouTube内容和用户行为的分析不仅对信息科学家,而且对传播研究人员、记者、社会学家等更多人开放。我们通过一些真实世界的数据示例来演示该工具的实用性。
{"title":"Characterizing video-based online information environment using VTracker","authors":"Thomas Marcoux, Oluwaseyi Adeliyi, Dr Nidhi Agarwal","doi":"10.1145/3487351.3489480","DOIUrl":"https://doi.org/10.1145/3487351.3489480","url":null,"abstract":"YouTube is the second most popular website on the internet and a major actor in information propagation, therefore making it efficient as a potential vehicle of misinformation. Current tools available for video platforms tend to hyperfocus on metadata aggregation and neglect the analysis of the actual videos. In an attempt to provide analysts the tools they need to perform various research (behavioral, political analysis, sociology,etc.), we present VTracker (formerly YouTubeTracker), an online analytical tool. Some of the insight analysts can derive from this tool are inorganic behavior detection and algorithmic manipulation. We aim to make the analysis of YouTube content and user behavior accessible not only to information scientists but also communication researchers, journalists, sociologists, and many more. We demonstrate the utility of the tool through some real world data samples.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129514009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast indexing algorithm for efficient kNN queries on complex networks 复杂网络中高效kNN查询的快速索引算法
Suomi Kobayashi, Shohei Matsugu, Hiroaki Shiokawa
k nearest neighbor (kNN) query is an essential graph data management tool to find relevant data entities suited to a user-specified query node. Graph indexing methods have the potential to achieve a quick kNN search response, the graph indexing methods are one of the promising approaches. However, they struggle to handle large-scale complex networks since constructing indexes and to querying kNN nodes in the large-scale networks are computationally expensive. In this paper, we propose a novel graph indexing algorithm for a fast kNN query on large networks. To overcome the aforementioned limitations, our algorithm generates two types of indexes based on the topological properties of complex networks. Our extensive experiments on real-world graphs clarify that our algorithm achieves up to 18,074 times faster indexing and 146 times faster kNN query than the state-of-the-art methods.
kNN查询是一种重要的图数据管理工具,用于查找适合用户指定查询节点的相关数据实体。图索引方法有可能实现快速的kNN搜索响应,图索引方法是有前途的方法之一。然而,它们很难处理大型复杂网络,因为在大型网络中构造索引和查询kNN节点的计算成本很高。本文针对大型网络上的快速kNN查询,提出了一种新的图索引算法。为了克服上述限制,我们的算法基于复杂网络的拓扑特性生成两种类型的索引。我们在现实世界图上的广泛实验表明,我们的算法比最先进的方法实现了高达18074倍的索引速度和146倍的kNN查询速度。
{"title":"Fast indexing algorithm for efficient kNN queries on complex networks","authors":"Suomi Kobayashi, Shohei Matsugu, Hiroaki Shiokawa","doi":"10.1145/3487351.3489442","DOIUrl":"https://doi.org/10.1145/3487351.3489442","url":null,"abstract":"k nearest neighbor (kNN) query is an essential graph data management tool to find relevant data entities suited to a user-specified query node. Graph indexing methods have the potential to achieve a quick kNN search response, the graph indexing methods are one of the promising approaches. However, they struggle to handle large-scale complex networks since constructing indexes and to querying kNN nodes in the large-scale networks are computationally expensive. In this paper, we propose a novel graph indexing algorithm for a fast kNN query on large networks. To overcome the aforementioned limitations, our algorithm generates two types of indexes based on the topological properties of complex networks. Our extensive experiments on real-world graphs clarify that our algorithm achieves up to 18,074 times faster indexing and 146 times faster kNN query than the state-of-the-art methods.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128397462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Temporalizing static graph autoencoders to handle temporal networks 时序化静态图形自动编码器来处理时序网络
Mounir Haddad, Cécile Bothorel, P. Lenca, Dominique Bedart
Graph autoencoders (GAE), also known as graph embedding methods, learn latent representations of the nodes of a graph in a low-dimensional space where the structural information is preserved. While real-world graphs are generally dynamic, only a few embedding methods handle the temporal dimension: Even though they have proven their reliability, the majority of the embedding techniques address the case of static networks and present poor performances when applied to temporal ones. In this paper, we present a generic method to temporalize static graph autoencoders, i.e. adapt different static graph embedding methods to the case of temporal networks. This is made possible by learning optimal connections between timesteps' graphs in order to form a single merged spatio-temporal network. We prove that this highly improves the inference tasks' accuracy of the temporalized methods. We also show that the learned connections are directly related to nodes characteristics and can be used beyond the scope of the embedding they are designed for.
图自编码器(GAE),也称为图嵌入方法,在低维空间中学习图节点的潜在表示,其中结构信息被保留。虽然现实世界的图通常是动态的,但只有少数嵌入方法处理时间维度:尽管它们已经证明了它们的可靠性,但大多数嵌入技术处理静态网络的情况,并且在应用于时间网络时表现不佳。本文提出了一种通用的静态图自编码器时间化方法,即根据时间网络的情况,采用不同的静态图嵌入方法。这可以通过学习时间步长图之间的最佳连接来实现,从而形成一个合并的时空网络。我们证明了这极大地提高了时间化方法的推理任务的准确性。我们还表明,学习到的连接与节点特征直接相关,并且可以在其设计的嵌入范围之外使用。
{"title":"Temporalizing static graph autoencoders to handle temporal networks","authors":"Mounir Haddad, Cécile Bothorel, P. Lenca, Dominique Bedart","doi":"10.1145/3487351.3488333","DOIUrl":"https://doi.org/10.1145/3487351.3488333","url":null,"abstract":"Graph autoencoders (GAE), also known as graph embedding methods, learn latent representations of the nodes of a graph in a low-dimensional space where the structural information is preserved. While real-world graphs are generally dynamic, only a few embedding methods handle the temporal dimension: Even though they have proven their reliability, the majority of the embedding techniques address the case of static networks and present poor performances when applied to temporal ones. In this paper, we present a generic method to temporalize static graph autoencoders, i.e. adapt different static graph embedding methods to the case of temporal networks. This is made possible by learning optimal connections between timesteps' graphs in order to form a single merged spatio-temporal network. We prove that this highly improves the inference tasks' accuracy of the temporalized methods. We also show that the learned connections are directly related to nodes characteristics and can be used beyond the scope of the embedding they are designed for.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"205 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116374973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The stock exchange of influencers: a financial approach for studying fanbase variation trends 影响者的股票交易:研究粉丝基础变化趋势的财务方法
Fabio Bertone, L. Vassio, Martino Trevisan
In many online social networks (OSNs), a limited portion of profiles emerges and reaches a large base of followers, i.e., the so-called social influencers. One of their main goals is to increase their fanbase to increase their visibility, engaging users through their content. In this work, we propose a novel parallel between the ecosystem of OSNs and the stock exchange market. Followers act as private investors, and they follow influencers, i.e., buy stocks, based on their individual preferences and on the information they gather through external sources. In this preliminary study, we show how the approaches proposed in the context of the stock exchange market can be successfully applied to social networks. Our case study focuses on 60 Italian Instagram influencers and shows how their followers short-term trends obtained through Bollinger bands become close to those found in external sources, Google Trends in our case, similarly to phenomena already observed in the financial market. Besides providing a strong correlation between these different trends, our results pose the basis for studying social networks with a new lens, linking them with a different domain.
在许多在线社交网络(OSNs)中,有限部分的个人资料出现并到达大量的追随者,即所谓的社会影响者。他们的主要目标之一是增加他们的粉丝基础,提高他们的知名度,通过他们的内容吸引用户。在这项工作中,我们提出了一种新的osn生态系统与证券交易所市场之间的平行关系。追随者充当私人投资者,他们跟随影响者,即根据他们的个人偏好和通过外部来源收集的信息购买股票。在这项初步研究中,我们展示了在证券交易所市场背景下提出的方法如何成功地应用于社交网络。我们的案例研究聚焦于60位意大利Instagram网红,并展示了他们的追随者通过布林带获得的短期趋势如何与外部来源(在我们的案例中是谷歌趋势)中发现的趋势接近,类似于金融市场中已经观察到的现象。除了提供这些不同趋势之间的强烈相关性之外,我们的研究结果还为用新的视角研究社交网络奠定了基础,将它们与不同的领域联系起来。
{"title":"The stock exchange of influencers: a financial approach for studying fanbase variation trends","authors":"Fabio Bertone, L. Vassio, Martino Trevisan","doi":"10.1145/3487351.3488413","DOIUrl":"https://doi.org/10.1145/3487351.3488413","url":null,"abstract":"In many online social networks (OSNs), a limited portion of profiles emerges and reaches a large base of followers, i.e., the so-called social influencers. One of their main goals is to increase their fanbase to increase their visibility, engaging users through their content. In this work, we propose a novel parallel between the ecosystem of OSNs and the stock exchange market. Followers act as private investors, and they follow influencers, i.e., buy stocks, based on their individual preferences and on the information they gather through external sources. In this preliminary study, we show how the approaches proposed in the context of the stock exchange market can be successfully applied to social networks. Our case study focuses on 60 Italian Instagram influencers and shows how their followers short-term trends obtained through Bollinger bands become close to those found in external sources, Google Trends in our case, similarly to phenomena already observed in the financial market. Besides providing a strong correlation between these different trends, our results pose the basis for studying social networks with a new lens, linking them with a different domain.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125982271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Forward and backward linear threshold ranks 正向和反向线性阈值排序
M. Blesa, Pau García-Rodríguez, M. Serna
We propose the FwLTR and BwLTR, two new centrality measures based on the Linear Threshold model. In contrast to the Linear Threshold rank (LTR), these measures differentiate between the incoming and the outgoing neighborhoods of the activation set that initiates the spreading process. Their rankings are distinguishable from the rest of the centrality measures considered traditionally. However, LTR and BwLTR behave quite similarly, while FwLTR is clearly different.
我们提出了两个新的基于线性阈值模型的中心性测度FwLTR和BwLTR。与线性阈值秩(LTR)相反,这些度量区分启动传播过程的激活集的传入和传出邻域。他们的排名与传统上考虑的其他中心性指标不同。然而,LTR和BwLTR的行为非常相似,而FwLTR则明显不同。
{"title":"Forward and backward linear threshold ranks","authors":"M. Blesa, Pau García-Rodríguez, M. Serna","doi":"10.1145/3487351.3488355","DOIUrl":"https://doi.org/10.1145/3487351.3488355","url":null,"abstract":"We propose the FwLTR and BwLTR, two new centrality measures based on the Linear Threshold model. In contrast to the Linear Threshold rank (LTR), these measures differentiate between the incoming and the outgoing neighborhoods of the activation set that initiates the spreading process. Their rankings are distinguishable from the rest of the centrality measures considered traditionally. However, LTR and BwLTR behave quite similarly, while FwLTR is clearly different.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132348369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Peeking through the homelessness system with a network science lens 从网络科学的角度透视无家可归者系统
C. Chelmis, Khandker Sadia Rahman
This paper models, for the first time, the homelessness system as a network of interconnected services which individuals traverse over time towards securing stable housing, and formalizes the concept of stability upon exit of the system. A computational analysis of individual-level longitudinal homelessness data shows that the ultimate goal is either reached quickly or not at all, regardless of starting conditions, indicating the importance of addressing the homeless' needs early on.
本文首次将无家可归者系统建模为一个相互关联的服务网络,个人在此网络中穿越以获得稳定的住房,并形式化了系统退出时稳定性的概念。对个人层面的纵向无家可归者数据的计算分析表明,无论起始条件如何,最终目标要么很快实现,要么根本无法实现,这表明了尽早解决无家可归者需求的重要性。
{"title":"Peeking through the homelessness system with a network science lens","authors":"C. Chelmis, Khandker Sadia Rahman","doi":"10.1145/3487351.3488321","DOIUrl":"https://doi.org/10.1145/3487351.3488321","url":null,"abstract":"This paper models, for the first time, the homelessness system as a network of interconnected services which individuals traverse over time towards securing stable housing, and formalizes the concept of stability upon exit of the system. A computational analysis of individual-level longitudinal homelessness data shows that the ultimate goal is either reached quickly or not at all, regardless of starting conditions, indicating the importance of addressing the homeless' needs early on.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"141 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132552006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A mathematical model for friend discovery from dynamic social graphs 动态社交图谱中朋友发现的数学模型
C. Leung, S. Singh
Nowadays, social networking is popular. As such, numerous social networking sites (e.g., Facebook, YouTube, Instagram) are generating very large volumes of social data rapidly. Valuable knowledge and information is embedded into these big social data, and is awaiting to be analyzed and mined via social network analysis and mining. In general, social networks can be represented as graphs. Because of the dynamic nature of social networking, edges and/or vertices keep adding to (or deleting from) the graphs. We present in this paper a mathematical model for friend discovery from dynamic social graphs. In particular, we focus on both linear algebra and graph theory approaches to discover interesting social entities---such as active followers---from dynamic social networks represented as dynamic directional social graphs.
如今,社交网络很受欢迎。因此,许多社交网站(如Facebook、YouTube、Instagram)正在迅速产生大量的社交数据。有价值的知识和信息被嵌入到这些大的社交数据中,等待着通过社交网络分析和挖掘进行分析和挖掘。一般来说,社交网络可以用图形表示。由于社交网络的动态特性,边和/或顶点不断添加(或删除)图。本文提出了一个从动态社交图中发现朋友的数学模型。特别是,我们专注于线性代数和图论方法来发现有趣的社会实体——比如活跃的追随者——从动态定向社交图表示的动态社交网络。
{"title":"A mathematical model for friend discovery from dynamic social graphs","authors":"C. Leung, S. Singh","doi":"10.1145/3487351.3489473","DOIUrl":"https://doi.org/10.1145/3487351.3489473","url":null,"abstract":"Nowadays, social networking is popular. As such, numerous social networking sites (e.g., Facebook, YouTube, Instagram) are generating very large volumes of social data rapidly. Valuable knowledge and information is embedded into these big social data, and is awaiting to be analyzed and mined via social network analysis and mining. In general, social networks can be represented as graphs. Because of the dynamic nature of social networking, edges and/or vertices keep adding to (or deleting from) the graphs. We present in this paper a mathematical model for friend discovery from dynamic social graphs. In particular, we focus on both linear algebra and graph theory approaches to discover interesting social entities---such as active followers---from dynamic social networks represented as dynamic directional social graphs.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134068087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Knowledge graph based platform of COVID-19 drugs and symptoms 基于知识图谱的新型冠状病毒药物与症状研究平台
Zhenhe Pan, Shuang Jiang, Juntao Su, Muzhe Guo, Yuanlin Zhang
Since the first cased of COVID-19 was identified in December 2019, a plethora of different drugs have been tested for COVID-19 treatment, making it a daunting task to keep track of the rapid growth of COVID-19 research landscape. Using the existing scientific literature search systems to develop a deeper understanding of COVID-19 related clinical experiments and results turns to be increasingly complicated. In this paper, we build a named entity recognition-based framework to extract information accurately and generate knowledge graph efficiently from a myriad of clinical test results articles. Of the tested drugs to treat COVID-19, we also develop a question answering system answers to medical questions regarding COVID-19 related symptoms using Wikipedia articles. We combine the state-of-the-art question answering model - Bidirectional Encoder Representations from Transformers (BERT), with Knowledge Graph to answer patients' questions about treatment options for their symptoms. This generated knowledge graph is user-friendly with intuitive and convenient tools to find the supporting and/or contradictory references of certain drugs with properties such as side effects, target population, etc. The trained question answering platform provides a straightforward and error-tolerant way to query for treatment suggestions given uses' input symptoms.
自2019年12月发现第一例COVID-19病例以来,已经测试了大量不同的药物来治疗COVID-19,这使得跟踪COVID-19研究领域的快速增长成为一项艰巨的任务。利用现有的科学文献检索系统来深入了解COVID-19相关的临床实验和结果变得越来越复杂。在本文中,我们建立了一个基于命名实体识别的框架,从大量的临床试验结果文章中准确地提取信息,并高效地生成知识图谱。在治疗COVID-19的测试药物中,我们还开发了一个问答系统,使用维基百科文章回答有关COVID-19相关症状的医学问题。我们结合了最先进的问答模型-双向编码器表示从变压器(BERT),知识图谱来回答病人的问题,关于治疗方案的症状。生成的知识图谱具有用户友好性和直观方便的工具,可以查找具有副作用、目标人群等属性的某些药物的支持和/或矛盾参考文献。经过训练的问答平台提供了一种简单、容错的方式,可以根据用户输入的症状查询治疗建议。
{"title":"Knowledge graph based platform of COVID-19 drugs and symptoms","authors":"Zhenhe Pan, Shuang Jiang, Juntao Su, Muzhe Guo, Yuanlin Zhang","doi":"10.1145/3487351.3489484","DOIUrl":"https://doi.org/10.1145/3487351.3489484","url":null,"abstract":"Since the first cased of COVID-19 was identified in December 2019, a plethora of different drugs have been tested for COVID-19 treatment, making it a daunting task to keep track of the rapid growth of COVID-19 research landscape. Using the existing scientific literature search systems to develop a deeper understanding of COVID-19 related clinical experiments and results turns to be increasingly complicated. In this paper, we build a named entity recognition-based framework to extract information accurately and generate knowledge graph efficiently from a myriad of clinical test results articles. Of the tested drugs to treat COVID-19, we also develop a question answering system answers to medical questions regarding COVID-19 related symptoms using Wikipedia articles. We combine the state-of-the-art question answering model - Bidirectional Encoder Representations from Transformers (BERT), with Knowledge Graph to answer patients' questions about treatment options for their symptoms. This generated knowledge graph is user-friendly with intuitive and convenient tools to find the supporting and/or contradictory references of certain drugs with properties such as side effects, target population, etc. The trained question answering platform provides a straightforward and error-tolerant way to query for treatment suggestions given uses' input symptoms.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130688992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting cyber security related Twitter accounts and different sub-groups: a multi-classifier approach 检测与网络安全相关的Twitter账户和不同的子组:一种多分类方法
Mohamad Imad Mahaini, Shujun Li
Many cyber security experts, organizations, and cyber criminals are active users on online social networks (OSNs). Therefore, detecting cyber security related accounts on OSNs and monitoring their activities can be very useful for different purposes such as cyber threat intelligence, detecting and preventing cyber attacks and online harms on OSNs, and evaluating the effectiveness of cyber security awareness activities on OSNs. In this paper, we report our work on developing several machine learning based classifiers for detecting cyber security related accounts on Twitter, including a base-line classifier for detecting cyber security related accounts in general, and three sub-classifiers for detecting three subsets of cyber security related accounts (individuals, hackers, and academia). To train and test the classifiers, we followed a more systemic approach (based on a cyber security taxonomy, real-time sampling of tweets, and crowdsourcing) to construct a dataset of cyber security related accounts with multiple tags assigned to each account. For each classifier, we considered a richer set of features than those used in past studies. Among five machine learning models tested, the Random Forest model achieved the best performance: 93% for the baseline classifier, 88-91% for the three sub-classifiers. We also studied feature reduction of the base-line classifier and showed that using just six features we can already achieve the same performance.
许多网络安全专家、组织和网络犯罪分子都是在线社交网络(osn)的活跃用户。因此,检测osn上的网络安全相关账户并监控其活动,可以用于网络威胁情报、检测和预防osn上的网络攻击和在线危害、评估osn上网络安全意识活动的有效性等不同目的。在本文中,我们报告了我们开发几个基于机器学习的分类器的工作,这些分类器用于检测Twitter上的网络安全相关帐户,包括用于检测一般网络安全相关帐户的基线分类器,以及用于检测网络安全相关帐户的三个子集(个人,黑客和学术界)的三个子分类器。为了训练和测试分类器,我们采用了一种更系统的方法(基于网络安全分类法、实时tweet采样和众包)来构建网络安全相关帐户的数据集,并为每个帐户分配多个标签。对于每个分类器,我们考虑了比过去研究中使用的更丰富的特征集。在测试的五个机器学习模型中,随机森林模型的性能最好:基线分类器的准确率为93%,三个子分类器的准确率为88-91%。我们还研究了基线分类器的特征约简,并表明仅使用六个特征我们就可以达到相同的性能。
{"title":"Detecting cyber security related Twitter accounts and different sub-groups: a multi-classifier approach","authors":"Mohamad Imad Mahaini, Shujun Li","doi":"10.1145/3487351.3492716","DOIUrl":"https://doi.org/10.1145/3487351.3492716","url":null,"abstract":"Many cyber security experts, organizations, and cyber criminals are active users on online social networks (OSNs). Therefore, detecting cyber security related accounts on OSNs and monitoring their activities can be very useful for different purposes such as cyber threat intelligence, detecting and preventing cyber attacks and online harms on OSNs, and evaluating the effectiveness of cyber security awareness activities on OSNs. In this paper, we report our work on developing several machine learning based classifiers for detecting cyber security related accounts on Twitter, including a base-line classifier for detecting cyber security related accounts in general, and three sub-classifiers for detecting three subsets of cyber security related accounts (individuals, hackers, and academia). To train and test the classifiers, we followed a more systemic approach (based on a cyber security taxonomy, real-time sampling of tweets, and crowdsourcing) to construct a dataset of cyber security related accounts with multiple tags assigned to each account. For each classifier, we considered a richer set of features than those used in past studies. Among five machine learning models tested, the Random Forest model achieved the best performance: 93% for the baseline classifier, 88-91% for the three sub-classifiers. We also studied feature reduction of the base-line classifier and showed that using just six features we can already achieve the same performance.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115541638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An insight into network structure measures and number of driver nodes 对网络结构、度量和驱动节点数量的洞察
Abida Sadaf, Luke Mathieson, Katarzyna Musial
Control of complex networks is one of the most challenging open problems within network science. One view says that we can only claim to fully understand a network if we have the ability to influence or control it and predict the results of the employed control mechanisms. The area of control and controllability has progressed notably in the past ten years with several frameworks proposed namely, structural, exact, and physical. With continuing advancement in the area, the need to develop effective and efficient control methods that provide robust control is increasingly critical. The ultimate responsibility for controlling the network lies with the set of driver nodes that, according to the classical definition of the control theory of complex systems, can steer the network from any given state to a desired final state. To be able to develop better control mechanisms, we need to understand the relationship between different network structures and the number of driver nodes needed to control a given structure. This will allow understanding of which networks might be easier to control and the resources needed to control them. In this paper, we present a systematic study that builds an understanding of how network profiles (random (R), small-world (SW), scale-free (SF)) influence the number of driver nodes needed for control. Additionally, we also consider real social networks and identify their driver nodes set to further expand the discussion. We mean to find a correlation between network structure measures and number of driver nodes. Our results show that there is in fact a strong relationship between these.
复杂网络的控制是网络科学中最具挑战性的开放性问题之一。一种观点认为,只有当我们有能力影响或控制一个网络,并预测所采用的控制机制的结果时,我们才能声称完全理解这个网络。控制和可控性领域在过去十年中取得了显著进展,提出了几个框架,即结构框架、精确框架和物理框架。随着该领域的不断发展,开发有效和高效的控制方法以提供鲁棒控制的需求变得越来越重要。控制网络的最终责任在于一组驱动节点,根据复杂系统控制理论的经典定义,这些节点可以将网络从任何给定状态引导到期望的最终状态。为了能够开发更好的控制机制,我们需要了解不同网络结构和控制给定结构所需的驱动节点数量之间的关系。这将有助于了解哪些网络可能更容易控制,以及控制它们所需的资源。在本文中,我们提出了一个系统的研究,该研究建立了对网络概况(随机(R),小世界(SW),无标度(SF))如何影响控制所需驱动节点数量的理解。此外,我们还考虑了真实的社交网络,并确定了其驱动节点集,以进一步扩大讨论。我们的目的是找出网络结构措施与驱动节点数量之间的相关性。我们的研究结果表明,事实上,这两者之间存在着很强的关系。
{"title":"An insight into network structure measures and number of driver nodes","authors":"Abida Sadaf, Luke Mathieson, Katarzyna Musial","doi":"10.1145/3487351.3488557","DOIUrl":"https://doi.org/10.1145/3487351.3488557","url":null,"abstract":"Control of complex networks is one of the most challenging open problems within network science. One view says that we can only claim to fully understand a network if we have the ability to influence or control it and predict the results of the employed control mechanisms. The area of control and controllability has progressed notably in the past ten years with several frameworks proposed namely, structural, exact, and physical. With continuing advancement in the area, the need to develop effective and efficient control methods that provide robust control is increasingly critical. The ultimate responsibility for controlling the network lies with the set of driver nodes that, according to the classical definition of the control theory of complex systems, can steer the network from any given state to a desired final state. To be able to develop better control mechanisms, we need to understand the relationship between different network structures and the number of driver nodes needed to control a given structure. This will allow understanding of which networks might be easier to control and the resources needed to control them. In this paper, we present a systematic study that builds an understanding of how network profiles (random (R), small-world (SW), scale-free (SF)) influence the number of driver nodes needed for control. Additionally, we also consider real social networks and identify their driver nodes set to further expand the discussion. We mean to find a correlation between network structure measures and number of driver nodes. Our results show that there is in fact a strong relationship between these.","PeriodicalId":320904,"journal":{"name":"Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining","volume":"402 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114887428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1