首页 > 最新文献

Applied Network Science最新文献

英文 中文
Operationalizing anthropological theory: four techniques to simplify networks of co-occurring ethnographic codes. 操作人类学理论:四种简化共同存在的民族志代码网络的技术。
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 Epub Date: 2023-05-05 DOI: 10.1007/s41109-023-00547-6
Alberto Cottica, Veronica Davidov, Magdalena Góralska, Jan Kubik, Guy Melançon, Richard Mole, Bruno Pinaud, Wojciech Szymański

The use of data and algorithms in the social sciences allows for exciting progress, but also poses epistemological challenges. Operations that appear innocent and purely technical may profoundly influence final results. Researchers working with data can make their process less arbitrary and more accountable by making theoretically grounded methodological choices. We apply this approach to the problem of simplifying networks representing ethnographic corpora, in the interest of visual interpretation. Network nodes represent ethnographic codes, and their edges the co-occurrence of codes in a corpus. We introduce and discuss four techniques to simplify such networks and facilitate visual analysis. We show how the mathematical characteristics of each one are aligned with an identifiable approach in sociology or anthropology: structuralism and post-structuralism; identifying the central concepts in a discourse; and discovering hegemonic and counter-hegemonic clusters of meaning. We then provide an example of how the four techniques complement each other in ethnographic analysis.

在社会科学中使用数据和算法可以取得令人兴奋的进展,但也带来了认识论的挑战。看似无害和纯粹技术性的操作可能会深刻影响最终结果。研究数据的研究人员可以通过做出基于理论的方法选择,使他们的过程不那么武断,更负责任。为了视觉解释的利益,我们将这种方法应用于简化代表民族志语料库的网络的问题。网络节点代表民族志代码,其边缘代表语料库中代码的共现。我们介绍并讨论了四种简化此类网络和促进可视化分析的技术。我们展示了每一种数学特征是如何与社会学或人类学中可识别的方法相一致的:结构主义和后结构主义;识别话语中的中心概念;发现霸权和反霸权的意义集群。然后,我们提供了一个例子,说明这四种技术在民族志分析中是如何相辅相成的。
{"title":"Operationalizing anthropological theory: four techniques to simplify networks of co-occurring ethnographic codes.","authors":"Alberto Cottica,&nbsp;Veronica Davidov,&nbsp;Magdalena Góralska,&nbsp;Jan Kubik,&nbsp;Guy Melançon,&nbsp;Richard Mole,&nbsp;Bruno Pinaud,&nbsp;Wojciech Szymański","doi":"10.1007/s41109-023-00547-6","DOIUrl":"10.1007/s41109-023-00547-6","url":null,"abstract":"<p><p>The use of data and algorithms in the social sciences allows for exciting progress, but also poses epistemological challenges. Operations that appear innocent and purely technical may profoundly influence final results. Researchers working with data can make their process less arbitrary and more accountable by making theoretically grounded methodological choices. We apply this approach to the problem of simplifying networks representing ethnographic corpora, in the interest of visual interpretation. Network nodes represent ethnographic codes, and their edges the co-occurrence of codes in a corpus. We introduce and discuss four techniques to simplify such networks and facilitate visual analysis. We show how the mathematical characteristics of each one are aligned with an identifiable approach in sociology or anthropology: structuralism and post-structuralism; identifying the central concepts in a discourse; and discovering hegemonic and counter-hegemonic clusters of meaning. We then provide an example of how the four techniques complement each other in ethnographic analysis.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"22"},"PeriodicalIF":2.2,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161994/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9857388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Source identification via contact tracing in the presence of asymptomatic patients. 在无症状患者存在的情况下通过接触者追踪进行源头识别。
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 Epub Date: 2023-08-21 DOI: 10.1007/s41109-023-00566-3
Gergely Ódor, Jana Vuckovic, Miguel-Angel Sanchez Ndoye, Patrick Thiran

Inferring the source of a diffusion in a large network of agents is a difficult but feasible task, if a few agents act as sensors revealing the time at which they got hit by the diffusion. One of the main limitations of current source identification algorithms is that they assume full knowledge of the contact network, which is rarely the case, especially for epidemics, where the source is called patient zero. Inspired by recent implementations of contact tracing algorithms, we propose a new framework, which we call Source Identification via Contact Tracing Framework (SICTF). In the SICTF, the source identification task starts at the time of the first hospitalization, and initially we have no knowledge about the contact network other than the identity of the first hospitalized agent. We may then explore the network by contact queries, and obtain symptom onset times by test queries in an adaptive way, i.e., both contact and test queries can depend on the outcome of previous queries. We also assume that some of the agents may be asymptomatic, and therefore cannot reveal their symptom onset time. Our goal is to find patient zero with as few contact and test queries as possible. We implement two local search algorithms for the SICTF: the LS algorithm, which has recently been proposed by Waniek et al. in a similar framework, is more data-efficient, but can fail to find the true source if many asymptomatic agents are present, whereas the LS+ algorithm is more robust to asymptomatic agents. By simulations we show that both LS and LS+ outperform previously proposed adaptive and non-adaptive source identification algorithms adapted to the SICTF, even though these baseline algorithms have full access to the contact network. Extending the theory of random exponential trees, we analytically approximate the source identification probability of the LS/ LS+ algorithms, and we show that our analytic results match the simulations. Finally, we benchmark our algorithms on the Data-driven COVID-19 Simulator (DCS) developed by Lorch et al., which is the first time source identification algorithms are tested on such a complex dataset.

如果几个代理充当传感器,揭示他们被扩散击中的时间,那么在大型代理网络中推断扩散的来源是一项困难但可行的任务。目前的源头识别算法的主要局限性之一是,它们假设对接触网络有充分的了解,而这种情况很少发生,尤其是对于流行病,源头被称为零号病人。受最近接触追踪算法实现的启发,我们提出了一个新的框架,我们称之为通过接触追踪框架进行源识别(SICTF)。在SICTF中,源识别任务从第一次住院时开始,最初我们除了第一个住院代理人的身份之外,对联系网络一无所知。然后,我们可以通过接触查询来探索网络,并以自适应的方式通过测试查询来获得症状发作时间,即接触和测试查询都可以取决于先前查询的结果。我们还假设一些药剂可能没有症状,因此无法透露其症状发作时间。我们的目标是通过尽可能少的联系和测试查询找到零号患者。我们为SICTF实现了两种局部搜索算法:最近由Waniek等人提出的LS算法。在类似的框架中,它更具数据效率,但如果存在许多无症状代理,则可能无法找到真正的源,而LS+算法对无症状代理更具鲁棒性。通过仿真,我们表明LS和LS+都优于先前提出的适用于SICTF的自适应和非自适应源识别算法,即使这些基线算法可以完全访问接触网络。在扩展随机指数树理论的基础上,我们对LS/LS+算法的源识别概率进行了解析近似,并表明我们的解析结果与仿真结果相匹配。最后,我们在Lorch等人开发的数据驱动新冠肺炎模拟器(DCS)上对我们的算法进行了基准测试,这是首次在如此复杂的数据集上测试源识别算法。
{"title":"Source identification via contact tracing in the presence of asymptomatic patients.","authors":"Gergely Ódor, Jana Vuckovic, Miguel-Angel Sanchez Ndoye, Patrick Thiran","doi":"10.1007/s41109-023-00566-3","DOIUrl":"10.1007/s41109-023-00566-3","url":null,"abstract":"<p><p>Inferring the source of a diffusion in a large network of agents is a difficult but feasible task, if a few agents act as sensors revealing the time at which they got hit by the diffusion. One of the main limitations of current source identification algorithms is that they assume full knowledge of the contact network, which is rarely the case, especially for epidemics, where the source is called patient zero. Inspired by recent implementations of contact tracing algorithms, we propose a new framework, which we call Source Identification via Contact Tracing Framework (SICTF). In the SICTF, the source identification task starts at the time of the first hospitalization, and initially we have no knowledge about the contact network other than the identity of the first hospitalized agent. We may then explore the network by contact queries, and obtain symptom onset times by test queries in an adaptive way, i.e., both contact and test queries can depend on the outcome of previous queries. We also assume that some of the agents may be asymptomatic, and therefore cannot reveal their symptom onset time. Our goal is to find patient zero with as few contact and test queries as possible. We implement two local search algorithms for the SICTF: the LS algorithm, which has recently been proposed by Waniek et al. in a similar framework, is more data-efficient, but can fail to find the true source if many asymptomatic agents are present, whereas the LS+ algorithm is more robust to asymptomatic agents. By simulations we show that both LS and LS+ outperform previously proposed adaptive and non-adaptive source identification algorithms adapted to the SICTF, even though these baseline algorithms have full access to the contact network. Extending the theory of random exponential trees, we analytically approximate the source identification probability of the LS/ LS+ algorithms, and we show that our analytic results match the simulations. Finally, we benchmark our algorithms on the Data-driven COVID-19 Simulator (DCS) developed by Lorch et al., which is the first time source identification algorithms are tested on such a complex dataset.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"53"},"PeriodicalIF":2.2,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10442312/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10442074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The role of luck in the success of social media influencers. 运气在社交媒体影响者成功中的作用。
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 Epub Date: 2023-07-25 DOI: 10.1007/s41109-023-00573-4
Stefania Ionescu, Anikó Hannák, Nicolò Pagan

Motivation: Social media platforms centered around content creators (CCs) faced rapid growth in the past decade. Currently, millions of CCs make livable incomes through platforms such as YouTube, TikTok, and Instagram. As such, similarly to the job market, it is important to ensure the success and income (usually related to the follower counts) of CCs reflect the quality of their work. Since quality cannot be observed directly, two other factors govern the network-formation process: (a) the visibility of CCs (resulted from, e.g., recommender systems and moderation processes) and (b) the decision-making process of seekers (i.e., of users focused on finding CCs). Prior virtual experiments and empirical work seem contradictory regarding fairness: While the first suggests the expected number of followers of CCs reflects their quality, the second says that quality does not perfectly predict success.

Results: Our paper extends prior models in order to bridge this gap between theoretical and empirical work. We (a) define a parameterized recommendation process which allocates visibility based on popularity biases, (b) define two metrics of individual fairness (ex-ante and ex-post), and (c) define a metric for seeker satisfaction. Through an analytical approach we show our process is an absorbing Markov Chain where exploring only the most popular CCs leads to lower expected times to absorption but higher chances of unfairness for CCs. While increasing the exploration helps, doing so only guarantees fair outcomes for the highest (and lowest) quality CC. Simulations revealed that CCs and seekers prefer different algorithmic designs: CCs generally have higher chances of fairness with anti-popularity biased recommendation processes, while seekers are more satisfied with popularity-biased recommendations. Altogether, our results suggest that while the exploration of low-popularity CCs is needed to improve fairness, platforms might not have the incentive to do so and such interventions do not entirely prevent unfair outcomes.

动机:以内容创作者为中心的社交媒体平台在过去十年中面临着快速增长。目前,数以百万计的CC通过YouTube、TikTok和Instagram等平台赚取了宜居的收入。因此,与就业市场类似,确保CC的成功和收入(通常与追随者数量有关)反映其工作质量是很重要的。由于无法直接观察质量,另外两个因素控制着网络形成过程:(a)CC的可见性(例如,由推荐系统和审核过程产生)和(b)寻求者的决策过程(即,专注于寻找CC的用户)。先前的虚拟实验和实证工作在公平性方面似乎是矛盾的:虽然第一个实验表明CC的预期追随者数量反映了他们的质量,但第二个实验认为质量并不能完全预测成功。结果:我们的论文扩展了先前的模型,以弥合理论和实证工作之间的差距。我们(a)定义了一个参数化的推荐过程,该过程基于流行度偏差来分配可见性,(b)定义了两个个人公平性指标(事前和事后),以及(c)定义了寻求者满意度指标。通过分析方法,我们表明我们的过程是一个吸收马尔可夫链,其中只探索最流行的CC会导致较低的预期吸收时间,但CC不公平的几率较高。虽然增加探索有帮助,但这样做只能保证最高(和最低)质量CC的公平结果。模拟显示,CC和寻求者更喜欢不同的算法设计:CC通常在反流行偏见的推荐过程中有更高的公平机会,而寻求者对流行偏见的建议更满意。总之,我们的研究结果表明,虽然需要探索低人气CC来提高公平性,但平台可能没有这样做的动机,而且这种干预措施并不能完全防止不公平的结果。
{"title":"The role of luck in the success of social media influencers.","authors":"Stefania Ionescu,&nbsp;Anikó Hannák,&nbsp;Nicolò Pagan","doi":"10.1007/s41109-023-00573-4","DOIUrl":"10.1007/s41109-023-00573-4","url":null,"abstract":"<p><strong>Motivation: </strong>Social media platforms centered around content creators (CCs) faced rapid growth in the past decade. Currently, millions of CCs make livable incomes through platforms such as YouTube, TikTok, and Instagram. As such, similarly to the job market, it is important to ensure the success and income (usually related to the follower counts) of CCs reflect the quality of their work. Since quality cannot be observed directly, two other factors govern the network-formation process: (a) the <i>visibility</i> of CCs (resulted from, e.g., recommender systems and moderation processes) and (b) the <i>decision-making process</i> of seekers (i.e., of users focused on finding CCs). Prior virtual experiments and empirical work seem contradictory regarding fairness: While the first suggests the expected number of followers of CCs reflects their quality, the second says that quality does not perfectly predict success.</p><p><strong>Results: </strong>Our paper extends prior models in order to bridge this gap between theoretical and empirical work. We (a) define a parameterized recommendation process which allocates visibility based on popularity biases, (b) define two metrics of individual fairness (ex-ante and ex-post), and (c) define a metric for seeker satisfaction. Through an analytical approach we show our process is an absorbing Markov Chain where exploring only the most popular CCs leads to lower expected times to absorption but higher chances of unfairness for CCs. While increasing the exploration helps, doing so only guarantees fair outcomes for the highest (and lowest) quality CC. Simulations revealed that CCs and seekers prefer different algorithmic designs: CCs generally have higher chances of fairness with anti-popularity biased recommendation processes, while seekers are more satisfied with popularity-biased recommendations. Altogether, our results suggest that while the exploration of low-popularity CCs is needed to improve fairness, platforms might not have the incentive to do so and such interventions do not entirely prevent unfair outcomes.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"46"},"PeriodicalIF":2.2,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368581/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9887900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GRANDPA: GeneRAtive network sampling using degree and property augmentation applied to the analysis of partially confidential healthcare networks. GRANDPA:使用度和属性增强的基因相关网络采样,应用于部分保密医疗网络的分析。
IF 1.3 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 Epub Date: 2023-05-11 DOI: 10.1007/s41109-023-00548-5
Carly A Bobak, Yifan Zhao, Joshua J Levy, A James O'Malley

Protecting medical privacy can create obstacles in the analysis and distribution of healthcare graphs and statistical inferences accompanying them. We pose a graph simulation model which generates networks using degree and property augmentation and provide a flexible R package that allows users to create graphs that preserve vertex attribute relationships and approximating the retention of topological properties observed in the original graph (e.g., community structure). We illustrate our proposed algorithm using a case study based on Zachary's karate network and a patient-sharing graph generated from Medicare claims data in 2019. In both cases, we find that community structure is preserved, and normalized root mean square error between cumulative distributions of the degrees across the generated and the original graphs is low (0.0508 and 0.0514 respectively).

保护医疗隐私会给医疗图表的分析和发布以及随之而来的统计推断造成障碍。我们提出了一个图仿真模型,该模型利用度和属性增强生成网络,并提供了一个灵活的 R 软件包,允许用户创建保留顶点属性关系的图,并近似保留原始图中观察到的拓扑属性(如群落结构)。我们使用基于 Zachary 空手道网络的案例研究和根据 2019 年医疗保险报销数据生成的患者共享图来说明我们提出的算法。在这两种情况下,我们都发现群落结构得到了保留,生成图和原始图的度数累积分布之间的归一化均方根误差很低(分别为 0.0508 和 0.0514)。
{"title":"GRANDPA: GeneRAtive network sampling using degree and property augmentation applied to the analysis of partially confidential healthcare networks.","authors":"Carly A Bobak, Yifan Zhao, Joshua J Levy, A James O'Malley","doi":"10.1007/s41109-023-00548-5","DOIUrl":"10.1007/s41109-023-00548-5","url":null,"abstract":"<p><p>Protecting medical privacy can create obstacles in the analysis and distribution of healthcare graphs and statistical inferences accompanying them. We pose a graph simulation model which generates networks using degree and property augmentation and provide a flexible R package that allows users to create graphs that preserve vertex attribute relationships and approximating the retention of topological properties observed in the original graph (e.g., community structure). We illustrate our proposed algorithm using a case study based on Zachary's karate network and a patient-sharing graph generated from Medicare claims data in 2019. In both cases, we find that community structure is preserved, and normalized root mean square error between cumulative distributions of the degrees across the generated and the original graphs is low (0.0508 and 0.0514 respectively).</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"23"},"PeriodicalIF":1.3,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10173245/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10115610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nudging cooperation among agents in an experimental social network. 实验性社会网络中代理人之间的助推合作。
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 Epub Date: 2023-09-12 DOI: 10.1007/s41109-023-00588-x
Gorm Gruner Jensen, Martin Benedikt Busch, Marco Piovesan, Jan O Haerter

We investigate the development of cooperative behavior in networks over time. In our controlled laboratory experiment, subjects can cooperate by sending costly messages that contain valuable information for the receiver or other subjects in the network. Any message sent can increase the chance that subjects find the information they are looking for and consequently their profit. We find that cooperation emerges spontaneously and remains stable over time. In an additional treatment, we provide a non-binding suggestion about who to contact at the beginning of the experiment. We find that subjects partially follow our recommendation, and this increases their own and others' profit. Despite the removal of suggestions, subjects build long-lasting relationships with the suggested contacts.

Supplementary information: The online version contains supplementary material available at 10.1007/s41109-023-00588-x.

我们研究了合作行为在网络中的发展。在我们的控制实验室实验中,受试者可以通过发送昂贵的信息来合作,这些信息包含对接收者或网络中的其他受试者有价值的信息。发送的任何信息都可以增加交易对象找到他们想要的信息的机会,从而增加他们的利润。我们发现,合作是自发产生的,并随着时间的推移保持稳定。在另一项处理中,我们提供了一个关于在实验开始时联系谁的非约束性建议。我们发现被试者部分遵循我们的建议,这增加了他们自己和他人的利益。尽管删除了建议,受试者还是与建议的联系人建立了持久的关系。补充信息:在线版本包含补充资料,提供地址为10.1007/s41109-023-00588-x。
{"title":"Nudging cooperation among agents in an experimental social network.","authors":"Gorm Gruner Jensen, Martin Benedikt Busch, Marco Piovesan, Jan O Haerter","doi":"10.1007/s41109-023-00588-x","DOIUrl":"10.1007/s41109-023-00588-x","url":null,"abstract":"<p><p>We investigate the development of cooperative behavior in networks over time. In our controlled laboratory experiment, subjects can cooperate by sending costly messages that contain valuable information for the receiver or other subjects in the network. Any message sent can increase the chance that subjects find the information they are looking for and consequently their profit. We find that cooperation emerges spontaneously and remains stable over time. In an additional treatment, we provide a non-binding suggestion about who to contact at the beginning of the experiment. We find that subjects partially follow our recommendation, and this increases their own and others' profit. Despite the removal of suggestions, subjects build long-lasting relationships with the suggested contacts.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s41109-023-00588-x.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"62"},"PeriodicalIF":2.2,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10497665/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10269051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameterizing network graph heterogeneity using a modified Weibull distribution. 使用改进的威布尔分布参数化网络图的异质性。
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 DOI: 10.1007/s41109-023-00544-9
Sinan A Ozbay, Maximilian M Nguyen

We present a simple method to quantitatively capture the heterogeneity in the degree distribution of a network graph using a single parameter σ . Using an exponential transformation of the shape parameter of the Weibull distribution, this control parameter allows the degree distribution to be easily interpolated between highly symmetric and highly heterogeneous distributions on the unit interval. This parameterization of heterogeneity also recovers several other canonical distributions as intermediate special cases, including the Gaussian, Rayleigh, and exponential distributions. We then outline a general graph generation algorithm to produce graphs with a desired amount of heterogeneity. The utility of this formulation of a heterogeneity parameter is demonstrated with examples relating to epidemiological modeling and spectral analysis.

我们提出了一种简单的方法,用单个参数σ来定量地捕捉网络图度分布的异质性。利用威布尔分布形状参数的指数变换,该控制参数允许在单位区间上的高度对称分布和高度非均匀分布之间容易地插值度分布。异质性的参数化也恢复了其他几种典型分布作为中间的特殊情况,包括高斯分布、瑞利分布和指数分布。然后,我们概述了一种通用的图形生成算法,以生成具有所需异质性的图形。通过与流行病学建模和光谱分析有关的例子,证明了这种异质性参数公式的实用性。
{"title":"Parameterizing network graph heterogeneity using a modified Weibull distribution.","authors":"Sinan A Ozbay,&nbsp;Maximilian M Nguyen","doi":"10.1007/s41109-023-00544-9","DOIUrl":"https://doi.org/10.1007/s41109-023-00544-9","url":null,"abstract":"<p><p>We present a simple method to quantitatively capture the heterogeneity in the degree distribution of a network graph using a single parameter <math><mi>σ</mi></math> . Using an exponential transformation of the shape parameter of the Weibull distribution, this control parameter allows the degree distribution to be easily interpolated between highly symmetric and highly heterogeneous distributions on the unit interval. This parameterization of heterogeneity also recovers several other canonical distributions as intermediate special cases, including the Gaussian, Rayleigh, and exponential distributions. We then outline a general graph generation algorithm to produce graphs with a desired amount of heterogeneity. The utility of this formulation of a heterogeneity parameter is demonstrated with examples relating to epidemiological modeling and spectral analysis.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"20"},"PeriodicalIF":2.2,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10144902/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9784643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Impact of network centrality and income on slowing infection spread after outbreaks. 网络中心性和收入对疫情后减缓感染传播的影响。
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 DOI: 10.1007/s41109-023-00540-z
Shiv G Yücel, Rafael H M Pereira, Pedro S Peixoto, Chico Q Camargo

The COVID-19 pandemic has shed light on how the spread of infectious diseases worldwide are importantly shaped by both human mobility networks and socio-economic factors. However, few studies look at how both socio-economic conditions and the complex network properties of human mobility patterns interact, and how they influence outbreaks together. We introduce a novel methodology, called the Infection Delay Model, to calculate how the arrival time of an infection varies geographically, considering both effective distance-based metrics and differences in regions' capacity to isolate-a feature associated with socio-economic inequalities. To illustrate an application of the Infection Delay Model, this paper integrates household travel survey data with cell phone mobility data from the São Paulo metropolitan region to assess the effectiveness of lockdowns to slow the spread of COVID-19. Rather than operating under the assumption that the next pandemic will begin in the same region as the last, the model estimates infection delays under every possible outbreak scenario, allowing for generalizable insights into the effectiveness of interventions to delay a region's first case. The model sheds light on how the effectiveness of lockdowns to slow the spread of disease is influenced by the interaction of mobility networks and socio-economic levels. We find that a negative relationship emerges between network centrality and the infection delay after a lockdown, irrespective of income. Furthermore, for regions across all income and centrality levels, outbreaks starting in less central locations were more effectively slowed by a lockdown. Using the Infection Delay Model, this paper identifies and quantifies a new dimension of disease risk faced by those most central in a mobility network.

2019冠状病毒病大流行揭示了人类流动网络和社会经济因素如何在很大程度上影响全球传染病的传播。然而,很少有研究着眼于社会经济条件和人类流动模式的复杂网络特性如何相互作用,以及它们如何共同影响疫情。我们引入了一种新的方法,称为感染延迟模型,来计算感染的到达时间在地理上是如何变化的,同时考虑了基于距离的有效度量和区域隔离能力的差异——这是与社会经济不平等相关的特征。为了说明感染延迟模型的应用,本文将家庭旅行调查数据与来自圣保罗大都会地区的手机移动数据相结合,以评估封锁对减缓COVID-19传播的有效性。该模型不是假设下一次大流行将在上一次大流行的同一地区开始,而是在每一种可能的爆发情景下估计感染延迟,从而可以对延迟一个地区第一例病例的干预措施的有效性进行概括性的了解。该模型揭示了封锁减缓疾病传播的有效性如何受到流动网络和社会经济水平相互作用的影响。我们发现,与收入无关,封锁后的网络中心性与感染延迟之间存在负相关关系。此外,对于所有收入和中心性水平的地区,从中心位置较低的地区开始的疫情通过封锁得到了更有效的减缓。利用感染延迟模型,本文确定并量化了移动网络中最核心人群所面临的疾病风险的新维度。
{"title":"Impact of network centrality and income on slowing infection spread after outbreaks.","authors":"Shiv G Yücel,&nbsp;Rafael H M Pereira,&nbsp;Pedro S Peixoto,&nbsp;Chico Q Camargo","doi":"10.1007/s41109-023-00540-z","DOIUrl":"https://doi.org/10.1007/s41109-023-00540-z","url":null,"abstract":"<p><p>The COVID-19 pandemic has shed light on how the spread of infectious diseases worldwide are importantly shaped by both human mobility networks and socio-economic factors. However, few studies look at how both socio-economic conditions and the complex network properties of human mobility patterns interact, and how they influence outbreaks together. We introduce a novel methodology, called the Infection Delay Model, to calculate how the arrival time of an infection varies geographically, considering both effective distance-based metrics and differences in regions' capacity to isolate-a feature associated with socio-economic inequalities. To illustrate an application of the Infection Delay Model, this paper integrates household travel survey data with cell phone mobility data from the São Paulo metropolitan region to assess the effectiveness of lockdowns to slow the spread of COVID-19. Rather than operating under the assumption that the next pandemic will begin in the same region as the last, the model estimates infection delays under every possible outbreak scenario, allowing for generalizable insights into the effectiveness of interventions to delay a region's first case. The model sheds light on how the effectiveness of lockdowns to slow the spread of disease is influenced by the interaction of mobility networks and socio-economic levels. We find that a negative relationship emerges between network centrality and the infection delay after a lockdown, irrespective of income. Furthermore, for regions across all income and centrality levels, outbreaks starting in less central locations were more effectively slowed by a lockdown. Using the Infection Delay Model, this paper identifies and quantifies a new dimension of disease risk faced by those most central in a mobility network.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"16"},"PeriodicalIF":2.2,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9951146/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10872656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Network embedding aided vaccine skepticism detection. 网络嵌入辅助疫苗怀疑性检测。
IF 1.3 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 Epub Date: 2023-02-16 DOI: 10.1007/s41109-023-00534-x
Ferenc Béres, Tamás Vilmos Michaletzky, Rita Csoma, András A Benczúr

We investigate automatic methods to assess COVID vaccination views in Twitter content. Vaccine skepticism has been a controversial topic of long history that has become more important than ever with the COVID-19 pandemic. Our main goal is to demonstrate the importance of network effects in detecting vaccination skeptic content. Towards this end, we collected and manually labeled vaccination-related Twitter content in the first half of 2021. Our experiments confirm that the network carries information that can be exploited to improve the accuracy of classifying attitudes towards vaccination over content classification as baseline. We evaluate a variety of network embedding algorithms, which we combine with text embedding to obtain classifiers for vaccination skeptic content. In our experiments, by using Walklets, we improve the AUC of the best classifier with no network information by. We publicly release our labels, Tweet IDs and source codes on GitHub.

我们研究了自动评估 Twitter 内容中 COVID 疫苗接种观点的方法。疫苗怀疑论是一个历史悠久的争议性话题,随着 COVID-19 的流行,它变得比以往任何时候都更加重要。我们的主要目标是证明网络效应在检测疫苗接种怀疑论内容方面的重要性。为此,我们收集了 2021 年上半年与疫苗接种相关的 Twitter 内容,并对其进行了人工标注。我们的实验证实,与内容分类相比,网络携带的信息可用于提高疫苗接种态度分类的准确性。我们评估了各种网络嵌入算法,并将其与文本嵌入相结合,从而获得疫苗接种怀疑论内容的分类器。在我们的实验中,通过使用 Walklets,我们提高了无网络信息的最佳分类器的 AUC。我们在 GitHub 上公开发布我们的标签、Tweet ID 和源代码。
{"title":"Network embedding aided vaccine skepticism detection.","authors":"Ferenc Béres, Tamás Vilmos Michaletzky, Rita Csoma, András A Benczúr","doi":"10.1007/s41109-023-00534-x","DOIUrl":"10.1007/s41109-023-00534-x","url":null,"abstract":"<p><p>We investigate automatic methods to assess COVID vaccination views in Twitter content. Vaccine skepticism has been a controversial topic of long history that has become more important than ever with the COVID-19 pandemic. Our main goal is to demonstrate the importance of network effects in detecting vaccination skeptic content. Towards this end, we collected and manually labeled vaccination-related Twitter content in the first half of 2021. Our experiments confirm that the network carries information that can be exploited to improve the accuracy of classifying attitudes towards vaccination over content classification as baseline. We evaluate a variety of network embedding algorithms, which we combine with text embedding to obtain classifiers for vaccination skeptic content. In our experiments, by using Walklets, we improve the AUC of the best classifier with no network information by. We publicly release our labels, Tweet IDs and source codes on GitHub.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"11"},"PeriodicalIF":1.3,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9933796/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10765431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The persistent homology of genealogical networks. 宗谱网络的持久同源性。
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2023-01-01 DOI: 10.1007/s41109-023-00538-7
Zachary M Boyd, Nick Callor, Taylor Gledhill, Abigail Jenkins, Robert Snellman, Benjamin Webb, Raelynn Wonnacott

Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one's immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network's persistence curve, which encodes the network's set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand.

家谱网络(即家谱)日益引起人们的兴趣,目前已知的最大数据集包括远远超过10亿个人。对家族史的兴趣也支持了一个85亿美元的产业,其规模预计将在7年内翻一番[FutureWise报告HC-1137]。然而,很少有数学关注谱系网络的复杂网络特性,特别是在大尺度上。家谱网络的结构特别令人感兴趣,因为形成联盟的实践,例如婚姻,通常是在一个人的直系亲属之外。在大多数其他网络中,包括其他社交网络,对关系形成的距离没有相应的限制。为了研究这对系谱网络的影响,我们使用持久同源性来识别和比较101个系谱网络和31个其他社会网络的结构。具体来说,我们引入了网络持久曲线的概念,它对网络的持久间隔集进行编码。我们发现,与其他社会网络相比,家谱网络的持续曲线具有明显的结构。这种结构上的差异也延伸到系谱和社会网络的子网络,这表明,即使数据不完整,持久的同源性也可以用于有意义的系谱网络分析。在这里,我们还描述了如何使用持久同源性来表示来自系谱网络的概念,例如共同祖先循环。我们预计,随着人们对祖先研究的兴趣不断扩大,持久的同源工具将在家谱探索中变得越来越重要。
{"title":"The persistent homology of genealogical networks.","authors":"Zachary M Boyd,&nbsp;Nick Callor,&nbsp;Taylor Gledhill,&nbsp;Abigail Jenkins,&nbsp;Robert Snellman,&nbsp;Benjamin Webb,&nbsp;Raelynn Wonnacott","doi":"10.1007/s41109-023-00538-7","DOIUrl":"https://doi.org/10.1007/s41109-023-00538-7","url":null,"abstract":"<p><p>Genealogical networks (i.e. family trees) are of growing interest, with the largest known data sets now including well over one billion individuals. Interest in family history also supports an 8.5 billion dollar industry whose size is projected to double within 7 years [FutureWise report HC-1137]. Yet little mathematical attention has been paid to the complex network properties of genealogical networks, especially at large scales. The structure of genealogical networks is of particular interest due to the practice of forming unions, e.g. marriages, that are typically well outside one's immediate family. In most other networks, including other social networks, no equivalent restriction exists on the distance at which relationships form. To study the effect this has on genealogical networks we use persistent homology to identify and compare the structure of 101 genealogical and 31 other social networks. Specifically, we introduce the notion of a network's persistence curve, which encodes the network's set of persistence intervals. We find that the persistence curves of genealogical networks have a distinct structure when compared to other social networks. This difference in structure also extends to subnetworks of genealogical and social networks suggesting that, even with incomplete data, persistent homology can be used to meaningfully analyze genealogical networks. Here we also describe how concepts from genealogical networks, such as common ancestor cycles, are represented using persistent homology. We expect that persistent homology tools will become increasingly important in genealogical exploration as popular interest in ancestry research continues to expand.</p>","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"8 1","pages":"15"},"PeriodicalIF":2.2,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9950181/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9353129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fraud, corruption, and collusion in public procurement activities, a systematic literature review on data-driven methods 公共采购活动中的欺诈、腐败和勾结,关于数据驱动方法的系统文献综述
IF 2.2 Q3 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2022-12-15 DOI: 10.1007/s41109-022-00523-6
Marcos S. Lyra, B. Damásio, Flávio L. Pinheiro, F. Bação
{"title":"Fraud, corruption, and collusion in public procurement activities, a systematic literature review on data-driven methods","authors":"Marcos S. Lyra, B. Damásio, Flávio L. Pinheiro, F. Bação","doi":"10.1007/s41109-022-00523-6","DOIUrl":"https://doi.org/10.1007/s41109-022-00523-6","url":null,"abstract":"","PeriodicalId":37010,"journal":{"name":"Applied Network Science","volume":"7 1","pages":"1-30"},"PeriodicalIF":2.2,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45918762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Applied Network Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1