使用Graphlets的社交网络模型选择

Q3 Mathematics Internet Mathematics Pub Date : 2012-12-01 DOI:10.1080/15427951.2012.671149

J. Janssen, Matt Hurshman, N. Kalyaniwalla

{"title":"使用Graphlets的社交网络模型选择","authors":"J. Janssen, Matt Hurshman, N. Kalyaniwalla","doi":"10.1080/15427951.2012.671149","DOIUrl":null,"url":null,"abstract":"Several network models have been proposed to explain the link structure observed in online social networks. This paper addresses the problem of choosing the model that best fits a given real-world network. We implement a model-selection method based on unsupervised learning. An alternating decision tree is trained using synthetic graphs generated according to each of the models under consideration. We use a broad array of features, with the aim of representing different structural aspects of the network. Features include the frequency counts of small subgraphs (graphlets) as well as features capturing the degree distribution and small-world property. Our method correctly classifies synthetic graphs, and is robust under perturbations of the graphs. We show that the graphlet counts alone are sufficient in separating the training data, indicating that graphlet counts are a good way of capturing network structure. We tested our approach on four Facebook graphs from various American universities. The models that best fit these data are those that are based on the principle of preferential attachment.","PeriodicalId":38105,"journal":{"name":"Internet Mathematics","volume":"8 1","pages":"338 - 363"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/15427951.2012.671149","citationCount":"54","resultStr":"{\"title\":\"Model Selection for Social Networks Using Graphlets\",\"authors\":\"J. Janssen, Matt Hurshman, N. Kalyaniwalla\",\"doi\":\"10.1080/15427951.2012.671149\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Several network models have been proposed to explain the link structure observed in online social networks. This paper addresses the problem of choosing the model that best fits a given real-world network. We implement a model-selection method based on unsupervised learning. An alternating decision tree is trained using synthetic graphs generated according to each of the models under consideration. We use a broad array of features, with the aim of representing different structural aspects of the network. Features include the frequency counts of small subgraphs (graphlets) as well as features capturing the degree distribution and small-world property. Our method correctly classifies synthetic graphs, and is robust under perturbations of the graphs. We show that the graphlet counts alone are sufficient in separating the training data, indicating that graphlet counts are a good way of capturing network structure. We tested our approach on four Facebook graphs from various American universities. The models that best fit these data are those that are based on the principle of preferential attachment.\",\"PeriodicalId\":38105,\"journal\":{\"name\":\"Internet Mathematics\",\"volume\":\"8 1\",\"pages\":\"338 - 363\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1080/15427951.2012.671149\",\"citationCount\":\"54\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet Mathematics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1080/15427951.2012.671149\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Mathematics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Mathematics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/15427951.2012.671149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 54

摘要

人们提出了几个网络模型来解释在线社交网络中观察到的链接结构。本文解决了选择最适合给定现实世界网络的模型的问题。我们实现了一种基于无监督学习的模型选择方法。根据考虑中的每个模型生成的合成图来训练交替决策树。我们使用广泛的特征阵列，目的是表示网络的不同结构方面。特征包括小子图(graphlet)的频率计数，以及捕获度分布和小世界属性的特征。该方法对合成图的分类是正确的，并且在图的摄动下具有鲁棒性。我们证明，单靠graphlet计数就足以分离训练数据，这表明graphlet计数是捕获网络结构的好方法。我们在来自美国不同大学的四个Facebook图表上测试了我们的方法。最适合这些数据的模型是那些基于优先依恋原则的模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Model Selection for Social Networks Using Graphlets

Several network models have been proposed to explain the link structure observed in online social networks. This paper addresses the problem of choosing the model that best fits a given real-world network. We implement a model-selection method based on unsupervised learning. An alternating decision tree is trained using synthetic graphs generated according to each of the models under consideration. We use a broad array of features, with the aim of representing different structural aspects of the network. Features include the frequency counts of small subgraphs (graphlets) as well as features capturing the degree distribution and small-world property. Our method correctly classifies synthetic graphs, and is robust under perturbations of the graphs. We show that the graphlet counts alone are sufficient in separating the training data, indicating that graphlet counts are a good way of capturing network structure. We tested our approach on four Facebook graphs from various American universities. The models that best fit these data are those that are based on the principle of preferential attachment.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Internet Mathematics Mathematics-Applied Mathematics

自引率

0.00%

发文量

期刊最新文献

Graph search via star sampling with and without replacement Preferential Placement for Community Structure Formation A Multi-type Preferential Attachment Tree Editorial Board EOV A Theory of Network Security: Principles of Natural Selection and Combinatorics