Benjamin D. Maldonado, Ryan Schuerkamp, Cassidy M. Martin, Ketra L. Rice, Nisha Nataraj, Margaret M. Brown, Christopher R. Harper, Curtis Florence, Philippe J. Giabbanelli
Suicide is a leading cause of death in the United States, particularly among adolescents. In recent years, suicidal ideation, attempts, and fatalities have increased. Systems maps can effectively represent complex issues such as suicide, thus providing decision-support tools for policymakers to identify and evaluate interventions. While network science has served to examine systems maps in fields such as obesity, there is limited research at the intersection of suicidology and network science. In this paper, we apply network science to a large causal map of adverse childhood experiences (ACEs) and suicide to address this gap. The National Center for Injury Prevention and Control (NCIPC) within the Centers for Disease Control and Prevention recently created a causal map that encapsulates ACEs and adolescent suicide in 361 concept nodes and 946 directed relationships. In this study, we examine this map and three similar models through three related questions: (Q1) how do existing network-based models of suicide differ in terms of node- and network-level characteristics? (Q2) Using the NCIPC model as a unifying framework, how do current suicide intervention strategies align with prevailing theories of suicide? (Q3) How can the use of network science on the NCIPC model guide suicide interventions?
在美国,自杀是导致死亡的一个主要原因,尤其是在青少年中。近年来,自杀意念、自杀未遂和死亡人数都有所增加。系统地图可以有效地表示自杀等复杂问题,从而为决策者提供决策支持工具,以确定和评估干预措施。虽然网络科学已在肥胖症等领域用于研究系统地图,但在自杀学和网络科学的交叉领域,研究还很有限。在本文中,我们将网络科学应用于童年不良经历(ACE)与自杀的大型因果关系图,以弥补这一不足。美国疾病控制和预防中心(Centers for Disease Control and Prevention)下属的国家伤害预防和控制中心(NCIPC)最近绘制了一张因果关系图,将ACE和青少年自杀囊括在361个概念节点和946个定向关系中。在本研究中,我们通过三个相关问题对该地图和三个类似模型进行了研究:(问题1)现有的基于网络的自杀模型在节点和网络层面的特征方面有何不同?(Q2) 以 NCIPC 模型为统一框架,当前的自杀干预策略如何与流行的自杀理论保持一致?(Q3) 如何利用网络科学来指导 NCIPC 模型中的自杀干预措施?
{"title":"Guiding prevention initiatives by applying network analysis to systems maps of adverse childhood experiences and adolescent suicide","authors":"Benjamin D. Maldonado, Ryan Schuerkamp, Cassidy M. Martin, Ketra L. Rice, Nisha Nataraj, Margaret M. Brown, Christopher R. Harper, Curtis Florence, Philippe J. Giabbanelli","doi":"10.1017/nws.2024.8","DOIUrl":"https://doi.org/10.1017/nws.2024.8","url":null,"abstract":"\u0000 Suicide is a leading cause of death in the United States, particularly among adolescents. In recent years, suicidal ideation, attempts, and fatalities have increased. Systems maps can effectively represent complex issues such as suicide, thus providing decision-support tools for policymakers to identify and evaluate interventions. While network science has served to examine systems maps in fields such as obesity, there is limited research at the intersection of suicidology and network science. In this paper, we apply network science to a large causal map of adverse childhood experiences (ACEs) and suicide to address this gap. The National Center for Injury Prevention and Control (NCIPC) within the Centers for Disease Control and Prevention recently created a causal map that encapsulates ACEs and adolescent suicide in 361 concept nodes and 946 directed relationships. In this study, we examine this map and three similar models through three related questions: (Q1) how do existing network-based models of suicide differ in terms of node- and network-level characteristics? (Q2) Using the NCIPC model as a unifying framework, how do current suicide intervention strategies align with prevailing theories of suicide? (Q3) How can the use of network science on the NCIPC model guide suicide interventions?","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141099232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
When people are asked to recall their social networks, theoretical and empirical work tells us that they rely on shortcuts, or heuristics. Cognitive social structures (CSSs) are multilayer social networks where each layer corresponds to an individual’s perception of the network. With multiple perceptions of the same network, CSSs contain rich information about how these heuristics manifest, motivating the question, Can we identify people who share the same heuristics? In this work, we propose a method for identifying cognitive structure across multiple network perceptions, analogous to how community detection aims to identify social structure in a network. To simultaneously model the joint latent social and cognitive structure, we study CSSs as three-dimensional tensors, employing low-rank nonnegative Tucker decompositions (NNTuck) to approximate the CSS—a procedure closely related to estimating a multilayer stochastic block model (SBM) from such data. We propose the resulting latent cognitive space as an operationalization of the sociological theory of social cognition by identifying individuals who share relational schema. In addition to modeling cognitively independent, dependent, and redundant networks, we propose a specific model instance and related statistical test for testing when there is social-cognitive agreement in a network: when the social and cognitive structures are equivalent. We use our approach to analyze four different CSSs and give insights into the latent cognitive structures of those networks.
{"title":"The latent cognitive structures of social networks","authors":"Izabel Aguiar, Johan Ugander","doi":"10.1017/nws.2024.7","DOIUrl":"https://doi.org/10.1017/nws.2024.7","url":null,"abstract":"When people are asked to recall their social networks, theoretical and empirical work tells us that they rely on shortcuts, or heuristics. Cognitive social structures (CSSs) are multilayer social networks where each layer corresponds to an individual’s perception of the network. With multiple perceptions of the same network, CSSs contain rich information about how these heuristics manifest, motivating the question, <jats:italic>Can we identify people who share the same heuristics?</jats:italic> In this work, we propose a method for identifying <jats:italic>cognitive structure</jats:italic> across multiple network perceptions, analogous to how community detection aims to identify <jats:italic>social structure</jats:italic> in a network. To simultaneously model the joint latent social and cognitive structure, we study CSSs as three-dimensional tensors, employing low-rank nonnegative Tucker decompositions (NNTuck) to approximate the CSS—a procedure closely related to estimating a multilayer stochastic block model (SBM) from such data. We propose the resulting latent cognitive space as an operationalization of the sociological theory of <jats:italic>social cognition</jats:italic> by identifying individuals who share <jats:italic>relational schema</jats:italic>. In addition to modeling cognitively <jats:italic>independent</jats:italic>, <jats:italic>dependent</jats:italic>, and <jats:italic>redundant</jats:italic> networks, we propose a specific model instance and related statistical test for testing when there is <jats:italic>social-cognitive agreement</jats:italic> in a network: when the social and cognitive structures are equivalent. We use our approach to analyze four different CSSs and give insights into the latent cognitive structures of those networks.","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140804893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sebastian Buß, Hendrik Molter, Rolf Niedermeier, Maciej Rymar
The betweenness centrality of a graph vertex measures how often this vertex is visited on shortest paths between other vertices of the graph. In the analysis of many real-world graphs or networks, the betweenness centrality of a vertex is used as an indicator for its relative importance in the network. In particular, it is among the most popular tools in social network analysis. In recent years, a growing number of real-world networks have been modeled as temporal graphs instead of conventional (static) graphs. In a temporal graph, we have a fixed set of vertices and there is a finite discrete set of time steps, and every edge might be present only at some time steps. While shortest paths are straightforward to define in static graphs, temporal paths can be considered “optimal” with respect to many different criteria, including length, arrival time, and overall travel time (shortest, foremost, and fastest paths). This leads to different concepts of temporal betweenness centrality, posing new challenges on the algorithmic side. We provide a systematic study of temporal betweenness variants based on various concepts of optimal temporal paths. Computing the betweenness centrality for vertices in a graph is closely related to counting the number of optimal paths between vertex pairs. While in static graphs computing the number of shortest paths is easily doable in polynomial time, we show that counting foremost and fastest paths is computationally intractable (#P-hard), and hence, the computation of the corresponding temporal betweenness values is intractable as well. For shortest paths and two selected special cases of foremost paths, we devise polynomial-time algorithms for temporal betweenness computation. Moreover, we also explore the distinction between strict (ascending time labels) and non-strict (non-descending time labels) time labels in temporal paths. In our experiments with established real-world temporal networks, we demonstrate the practical effectiveness of our algorithms, compare the various betweenness concepts, and derive recommendations on their practical use.
{"title":"Algorithmic aspects of temporal betweenness","authors":"Sebastian Buß, Hendrik Molter, Rolf Niedermeier, Maciej Rymar","doi":"10.1017/nws.2024.5","DOIUrl":"https://doi.org/10.1017/nws.2024.5","url":null,"abstract":"The <jats:italic>betweenness centrality</jats:italic> of a graph vertex measures how often this vertex is visited on shortest paths between other vertices of the graph. In the analysis of many real-world graphs or networks, the betweenness centrality of a vertex is used as an indicator for its relative importance in the network. In particular, it is among the most popular tools in social network analysis. In recent years, a growing number of real-world networks have been modeled as <jats:italic>temporal graphs</jats:italic> instead of conventional (static) graphs. In a temporal graph, we have a fixed set of vertices and there is a finite discrete set of time steps, and every edge might be present only at some time steps. While shortest paths are straightforward to define in static graphs, temporal paths can be considered “optimal” with respect to many different criteria, including length, arrival time, and overall travel time (shortest, foremost, and fastest paths). This leads to different concepts of <jats:italic>temporal betweenness centrality</jats:italic>, posing new challenges on the algorithmic side. We provide a systematic study of temporal betweenness variants based on various concepts of optimal temporal paths. Computing the betweenness centrality for vertices in a graph is closely related to counting the number of optimal paths between vertex pairs. While in static graphs computing the number of shortest paths is easily doable in polynomial time, we show that counting foremost and fastest paths is computationally intractable (#P-hard), and hence, the computation of the corresponding temporal betweenness values is intractable as well. For shortest paths and two selected special cases of foremost paths, we devise polynomial-time algorithms for temporal betweenness computation. Moreover, we also explore the distinction between strict (ascending time labels) and non-strict (non-descending time labels) time labels in temporal paths. In our experiments with established real-world temporal networks, we demonstrate the practical effectiveness of our algorithms, compare the various betweenness concepts, and derive recommendations on their practical use.","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140560008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Collecting network data directly from network members can be challenging. One alternative involves inferring a network from observed groups, for example, inferring a network of scientific collaboration from researchers’ observed paper authorships. In this paper, I explore when an unobserved undirected network of interest can accurately be inferred from observed groups. The analysis uses simulations to experimentally manipulate the structure of the unobserved network to be inferred, the number of groups observed, the extent to which the observed groups correspond to cliques in the unobserved network, and the method used to draw inferences. I find that when a small number of groups are observed, an unobserved network can be accurately inferred using a simple unweighted two-mode projection, provided that each group’s membership closely corresponds to a clique in the unobserved network. In contrast, when a large number of groups are observed, an unobserved network can be accurately inferred using a statistical backbone extraction model, even if the groups’ memberships are mostly random. These findings offer guidance for researchers seeking to indirectly measure a network of interest using observations of groups.
{"title":"When can networks be inferred from observed groups?","authors":"Zachary P. Neal","doi":"10.1017/nws.2024.6","DOIUrl":"https://doi.org/10.1017/nws.2024.6","url":null,"abstract":"Collecting network data directly from network members can be challenging. One alternative involves inferring a network from observed groups, for example, inferring a network of scientific collaboration from researchers’ observed paper authorships. In this paper, I explore when an unobserved undirected network of interest can accurately be inferred from observed groups. The analysis uses simulations to experimentally manipulate the structure of the unobserved network to be inferred, the number of groups observed, the extent to which the observed groups correspond to cliques in the unobserved network, and the method used to draw inferences. I find that when a small number of groups are observed, an unobserved network can be accurately inferred using a simple unweighted two-mode projection, provided that each group’s membership closely corresponds to a clique in the unobserved network. In contrast, when a large number of groups are observed, an unobserved network can be accurately inferred using a statistical backbone extraction model, even if the groups’ memberships are mostly random. These findings offer guidance for researchers seeking to indirectly measure a network of interest using observations of groups.","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140560011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We introduce a novel preferential attachment model using the draw variables of a modified Pólya urn with an expanding number of colors, notably capable of modeling influential opinions (in terms of vertices of high degree) as the graph evolves. Similar to the Barabási-Albert model, the generated graph grows in size by one vertex at each time instance; in contrast however, each vertex of the graph is uniquely characterized by a color, which is represented by a ball color in the Pólya urn. More specifically at each time step, we draw a ball from the urn and return it to the urn along with a number of reinforcing balls of the same color; we also add another ball of a new color to the urn. We then construct an edge between the new vertex (corresponding to the new color) and the existing vertex whose color ball is drawn. Using color-coded vertices in conjunction with the time-varying reinforcing parameter allows for vertices added (born) later in the process to potentially attain a high degree in a way that is not captured in the Barabási-Albert model. We study the degree count of the vertices by analyzing the draw vectors of the underlying stochastic process. In particular, we establish the probability distribution of the random variable counting the number of draws of a given color which determines the degree of the vertex corresponding to that color in the graph. We further provide simulation results presenting a comparison between our model and the Barabási-Albert network.
{"title":"Generating preferential attachment graphs via a Pólya urn with expanding colors","authors":"Somya Singh, Fady Alajaji, Bahman Gharesifard","doi":"10.1017/nws.2024.3","DOIUrl":"https://doi.org/10.1017/nws.2024.3","url":null,"abstract":"We introduce a novel preferential attachment model using the draw variables of a modified Pólya urn with an expanding number of colors, notably capable of modeling influential opinions (in terms of vertices of high degree) as the graph evolves. Similar to the Barabási-Albert model, the generated graph grows in size by one vertex at each time instance; in contrast however, each vertex of the graph is uniquely characterized by a color, which is represented by a ball color in the Pólya urn. More specifically at each time step, we draw a ball from the urn and return it to the urn along with a number of reinforcing balls of the same color; we also add another ball of a new color to the urn. We then construct an edge between the new vertex (corresponding to the new color) and the existing vertex whose color ball is drawn. Using color-coded vertices in conjunction with the time-varying reinforcing parameter allows for vertices added (born) later in the process to potentially attain a high degree in a way that is not captured in the Barabási-Albert model. We study the degree count of the vertices by analyzing the draw vectors of the underlying stochastic process. In particular, we establish the probability distribution of the random variable counting the number of draws of a given color which determines the degree of the vertex corresponding to that color in the graph. We further provide simulation results presenting a comparison between our model and the Barabási-Albert network.","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140560733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automated detection of edge clusters via an overfitted mixture prior – CORRIGENDUM","authors":"H. T. D. Pham, Daniel K. Sewell","doi":"10.1017/nws.2024.4","DOIUrl":"https://doi.org/10.1017/nws.2024.4","url":null,"abstract":"","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140222218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Researchers theorize that many real-world networks exhibit community structure where within-community edges are more likely than between-community edges. While numerous methods exist to cluster nodes into different communities, less work has addressed this question: given some network, does it exhibit statistically meaningful community structure? We answer this question in a principled manner by framing it as a statistical hypothesis test in terms of a general and model-agnostic community structure parameter. Leveraging this parameter, we propose a simple and interpretable test statistic used to formulate two separate hypothesis testing frameworks. The first is an asymptotic test against a baseline value of the parameter while the second tests against a baseline model using bootstrap-based thresholds. We prove theoretical properties of these tests and demonstrate how the proposed method yields rich insights into real-world datasets.
{"title":"A generalized hypothesis test for community structure in networks","authors":"Eric Yanchenko, Srijan Sengupta","doi":"10.1017/nws.2024.1","DOIUrl":"https://doi.org/10.1017/nws.2024.1","url":null,"abstract":"<p>Researchers theorize that many real-world networks exhibit community structure where within-community edges are more likely than between-community edges. While numerous methods exist to cluster nodes into different communities, less work has addressed this question: given some network, does it exhibit <span>statistically meaningful</span> community structure? We answer this question in a principled manner by framing it as a statistical hypothesis test in terms of a general and model-agnostic community structure parameter. Leveraging this parameter, we propose a simple and interpretable test statistic used to formulate two separate hypothesis testing frameworks. The first is an asymptotic test against a baseline value of the parameter while the second tests against a baseline model using bootstrap-based thresholds. We prove theoretical properties of these tests and demonstrate how the proposed method yields rich insights into real-world datasets.</p>","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140098491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Empirical articles vary considerably in how they measure child and adolescent friendship networks. This meta-analysis examines four methodological moderators of children’s and adolescents’ average outdegree centrality in friendship networks: boundary specification, operational definition of friendship, unlimited vs. fixed choice design, and roster vs. free recall design. Specifically, multi-level random effects models were conducted using 261 average outdegree centrality estimates from 71 English-language peer-reviewed articles and 55 unique datasets. There were no significant differences in average outdegree centrality for child and adolescent friendship networks bounded at the classroom, grade, and school-levels. Using a name generator focused on best/close friends yielded significantly lower average outdegree centrality estimates than using a name generator focused on friends. Fixed choice designs with under 10 nominations were associated with significantly lower estimates of average outdegree centrality while fixed choice designs with 10 or more nominations were associated with significantly higher estimates of average outdegree centrality than unlimited choice designs. Free recall designs were associated with significantly lower estimates of average outdegree centrality than roster designs. Results are discussed within the context of their implications for the future measurement of child and adolescent friendship networks.
{"title":"Methodological moderators of average outdegree centrality: A meta-analysis of child and adolescent friendship networks","authors":"Jennifer Watling Neal","doi":"10.1017/nws.2024.2","DOIUrl":"https://doi.org/10.1017/nws.2024.2","url":null,"abstract":"Empirical articles vary considerably in how they measure child and adolescent friendship networks. This meta-analysis examines four methodological moderators of children’s and adolescents’ average outdegree centrality in friendship networks: boundary specification, operational definition of friendship, unlimited vs. fixed choice design, and roster vs. free recall design. Specifically, multi-level random effects models were conducted using 261 average outdegree centrality estimates from 71 English-language peer-reviewed articles and 55 unique datasets. There were no significant differences in average outdegree centrality for child and adolescent friendship networks bounded at the classroom, grade, and school-levels. Using a name generator focused on best/close friends yielded significantly lower average outdegree centrality estimates than using a name generator focused on friends. Fixed choice designs with under 10 nominations were associated with significantly lower estimates of average outdegree centrality while fixed choice designs with 10 or more nominations were associated with significantly higher estimates of average outdegree centrality than unlimited choice designs. Free recall designs were associated with significantly lower estimates of average outdegree centrality than roster designs. Results are discussed within the context of their implications for the future measurement of child and adolescent friendship networks.","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140075991","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Most community detection methods focus on clustering actors with common features in a network. However, clustering edges offers a more intuitive way to understand the network structure in many real-life applications. Among the existing methods for network edge clustering, the majority are algorithmic, with the exception of the latent space edge clustering (LSEC) model proposed by Sewell (Journal of Computational and Graphical Statistics, 30(2), 390–405, 2021). LSEC was shown to have good performance in simulation and real-life data analysis, but fitting this model requires prior knowledge of the number of clusters and latent dimensions, which are often unknown to researchers. Within a Bayesian framework, we propose an extension to the LSEC model using a sparse finite mixture prior that supports automated selection of the number of clusters. We refer to our proposed approach as the automated LSEC or aLSEC. We develop a variational Bayes generalized expectation-maximization approach and a Hamiltonian Monte Carlo-within Gibbs algorithm for estimation. Our simulation study showed that aLSEC reduced run time by 10 to over 100 times compared to LSEC. Like LSEC, aLSEC maintains a computational cost that grows linearly with the number of actors in a network, making it scalable to large sparse networks. We developed the R package aLSEC which implements the proposed methodology.
{"title":"Automated detection of edge clusters via an overfitted mixture prior","authors":"Hanh T. D. Pham, Daniel K. Sewell","doi":"10.1017/nws.2023.22","DOIUrl":"https://doi.org/10.1017/nws.2023.22","url":null,"abstract":"Most community detection methods focus on clustering actors with common features in a network. However, clustering edges offers a more intuitive way to understand the network structure in many real-life applications. Among the existing methods for network edge clustering, the majority are algorithmic, with the exception of the latent space edge clustering (LSEC) model proposed by Sewell (<jats:italic>Journal of Computational and Graphical Statistics, 30</jats:italic>(2), 390–405, 2021). LSEC was shown to have good performance in simulation and real-life data analysis, but fitting this model requires prior knowledge of the number of clusters and latent dimensions, which are often unknown to researchers. Within a Bayesian framework, we propose an extension to the LSEC model using a sparse finite mixture prior that supports automated selection of the number of clusters. We refer to our proposed approach as the automated LSEC or aLSEC. We develop a variational Bayes generalized expectation-maximization approach and a Hamiltonian Monte Carlo-within Gibbs algorithm for estimation. Our simulation study showed that aLSEC reduced run time by 10 to over 100 times compared to LSEC. Like LSEC, aLSEC maintains a computational cost that grows linearly with the number of actors in a network, making it scalable to large sparse networks. We developed the R package aLSEC which implements the proposed methodology.","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139516558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Viral marketing campaigns target primarily those individuals who are central in social networks and hence have social influence. Marketing events, however, may attract diverse audience. Despite the importance of event marketing, the influence of heterogeneous target groups is not well understood yet. In this paper, we define the Audience Selection (AS) problem in which different sets of agents need to be evaluated and compared based on their social influence. A typical application of Audience selection is choosing locations for a series of marketing events. The Audience selection problem is different from the well-known Influence Maximization (IM) problem in two aspects. Firstly, it deals with sets rather than nodes. Secondly, the sets are diverse, composed by a mixture of influential and ordinary agents. Thus, Audience selection needs to assess the contribution of ordinary agents too, while IM only aims to find top spreaders. We provide a systemic test for ranking influence measures in the Audience Selection problem based on node sampling and on a novel statistical method, the Sum of Ranking Differences. Using a Linear Threshold diffusion model on two online social networks, we evaluate eight network measures of social influence. We demonstrate that the statistical assessment of these influence measures is remarkably different in the Audience Selection problem, when low-ranked individuals are present, from the IM problem, when we focus on the algorithm’s top choices exclusively.
病毒式营销活动的主要目标群体是社交网络中的核心人物,因此具有社会影响力。然而,营销活动可能会吸引不同的受众。尽管事件营销非常重要,但人们对异质目标群体的影响还不甚了解。在本文中,我们定义了 "受众选择"(Audience Selection,AS)问题,在这个问题中,需要根据不同代理的社会影响力对其进行评估和比较。受众选择的一个典型应用是为一系列营销活动选择地点。受众选择问题与著名的影响力最大化(IM)问题有两点不同。首先,它处理的是集合而不是节点。其次,集合是多样化的,由有影响力的代理和普通代理混合组成。因此,受众选择也需要评估普通代理的贡献,而 IM 的目的只是找到顶级传播者。我们基于节点抽样和一种新颖的统计方法--排名差异总和,为受众选择问题中的排名影响度量提供了一个系统测试。我们在两个在线社交网络上使用线性阈值扩散模型,评估了八种社会影响力网络测量方法。我们证明,在受众选择问题中,当存在低排名个体时,这些影响度量的统计评估与在即时通讯问题中,当我们只关注算法的首选时,这些影响度量的统计评估明显不同。
{"title":"Audience selection for maximizing social influence","authors":"Balázs R. Sziklai, Balázs Lengyel","doi":"10.1017/nws.2023.23","DOIUrl":"https://doi.org/10.1017/nws.2023.23","url":null,"abstract":"Viral marketing campaigns target primarily those individuals who are central in social networks and hence have social influence. Marketing events, however, may attract diverse audience. Despite the importance of event marketing, the influence of heterogeneous target groups is not well understood yet. In this paper, we define the Audience Selection (AS) problem in which different sets of agents need to be evaluated and compared based on their social influence. A typical application of Audience selection is choosing locations for a series of marketing events. The Audience selection problem is different from the well-known Influence Maximization (IM) problem in two aspects. Firstly, it deals with sets rather than nodes. Secondly, the sets are diverse, composed by a mixture of influential and ordinary agents. Thus, Audience selection needs to assess the contribution of ordinary agents too, while IM only aims to find top spreaders. We provide a systemic test for ranking influence measures in the Audience Selection problem based on node sampling and on a novel statistical method, the Sum of Ranking Differences. Using a Linear Threshold diffusion model on two online social networks, we evaluate eight network measures of social influence. We demonstrate that the statistical assessment of these influence measures is remarkably different in the Audience Selection problem, when low-ranked individuals are present, from the IM problem, when we focus on the algorithm’s top choices exclusively.","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139465242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}