Plasmode simulation for the evaluation of causal inference methods in homophilous social networks

Vanessa McNealis, Erica E. M. Moodie, Nema Dean
{"title":"Plasmode simulation for the evaluation of causal inference methods in homophilous social networks","authors":"Vanessa McNealis, Erica E. M. Moodie, Nema Dean","doi":"arxiv-2409.01316","DOIUrl":null,"url":null,"abstract":"Typical simulation approaches for evaluating the performance of statistical\nmethods on populations embedded in social networks may fail to capture\nimportant features of real-world networks. It can therefore be unclear whether\ninference methods for causal effects due to interference that have been shown\nto perform well in such synthetic networks are applicable to social networks\nwhich arise in the real world. Plasmode simulation studies use a real dataset\ncreated from natural processes, but with part of the data-generation mechanism\nknown. However, given the sensitivity of relational data, many network data are\nprotected from unauthorized access or disclosure. In such case, plasmode\nsimulations cannot use released versions of real datasets which often omit the\nnetwork links, and instead can only rely on parameters estimated from them. A\nstatistical framework for creating replicated simulation datasets from private\nsocial network data is developed and validated. The approach consists of\nsimulating from a parametric exponential family random graph model fitted to\nthe network data and resampling from the observed exposure and covariate\ndistributions to preserve the associations among these variables.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01316","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Typical simulation approaches for evaluating the performance of statistical methods on populations embedded in social networks may fail to capture important features of real-world networks. It can therefore be unclear whether inference methods for causal effects due to interference that have been shown to perform well in such synthetic networks are applicable to social networks which arise in the real world. Plasmode simulation studies use a real dataset created from natural processes, but with part of the data-generation mechanism known. However, given the sensitivity of relational data, many network data are protected from unauthorized access or disclosure. In such case, plasmode simulations cannot use released versions of real datasets which often omit the network links, and instead can only rely on parameters estimated from them. A statistical framework for creating replicated simulation datasets from private social network data is developed and validated. The approach consists of simulating from a parametric exponential family random graph model fitted to the network data and resampling from the observed exposure and covariate distributions to preserve the associations among these variables.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
质点模拟用于评估同亲社会网络中的因果推理方法
典型的模拟方法用于评估统计方法在嵌入社交网络的人群中的表现,可能无法捕捉真实世界网络的重要特征。因此,在此类合成网络中表现出色的干扰因果效应推断方法是否适用于真实世界中出现的社会网络,可能并不清楚。质点模拟研究使用的是由自然过程创建的真实数据集,但数据生成机制的一部分是已知的。然而,鉴于关系数据的敏感性,许多网络数据都受到保护,以防止未经授权的访问或泄露。在这种情况下,等离子体模拟无法使用真实数据集的发布版本,因为这些数据集往往省略了网络链接,而只能依靠从中估算出的参数。本文开发并验证了从私人社交网络数据创建复制模拟数据集的统计框架。该方法包括从一个拟合网络数据的参数指数族随机图模型中进行模拟,并从观察到的暴露和协变分布中重新采样,以保留这些变量之间的关联。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Statistical Finite Elements via Interacting Particle Langevin Dynamics Graph sub-sampling for divide-and-conquer algorithms in large networks Optimizing VarLiNGAM for Scalable and Efficient Time Series Causal Discovery Best Linear Unbiased Estimate from Privatized Histograms A Bayesian Optimization through Sequential Monte Carlo and Statistical Physics-Inspired Techniques
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1