网络模型中样本大小分析的一般蒙特卡罗方法。

IF 7.8 1区心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Psychological methods Pub Date : 2023-07-10 DOI:10.1037/met0000555

Mihai A Constantin, Noémi K Schuurman, Jeroen K Vermunt

{"title":"网络模型中样本大小分析的一般蒙特卡罗方法。","authors":"Mihai A Constantin, Noémi K Schuurman, Jeroen K Vermunt","doi":"10.1037/met0000555","DOIUrl":null,"url":null,"abstract":"We introduce a general method for sample size computations in the context of cross-sectional network models. The method takes the form of an automated Monte Carlo algorithm, designed to find an optimal sample size while iteratively concentrating the computations on the sample sizes that seem most relevant. The method requires three inputs: (1) a hypothesized network structure or desired characteristics of that structure, (2) an estimation performance measure and its corresponding target value (e.g., a sensitivity of 0.6), and (3) a statistic and its corresponding target value that determines how the target value for the performance measure be reached (e.g., reaching a sensitivity of 0.6 with a probability of 0.8). The method consists of a Monte Carlo simulation step for computing the performance measure and the statistic for several sample sizes selected from an initial candidate sample size range, a curve-fitting step for interpolating the statistic across the entire candidate range, and a stratified bootstrapping step to quantify the uncertainty around the recommendation provided. We evaluated the performance of the method for the Gaussian Graphical Model, but it can easily extend to other models. The method displayed good performance, providing sample size recommendations that were, on average, within three observations of a benchmark sample size, with the highest standard deviation of 25.87 observations. The method discussed is implemented in the form of an R package called powerly, available on GitHub and CRAN. (PsycInfo Database Record (c) 2023 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.8000,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A general Monte Carlo method for sample size analysis in the context of network models.\",\"authors\":\"Mihai A Constantin, Noémi K Schuurman, Jeroen K Vermunt\",\"doi\":\"10.1037/met0000555\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We introduce a general method for sample size computations in the context of cross-sectional network models. The method takes the form of an automated Monte Carlo algorithm, designed to find an optimal sample size while iteratively concentrating the computations on the sample sizes that seem most relevant. The method requires three inputs: (1) a hypothesized network structure or desired characteristics of that structure, (2) an estimation performance measure and its corresponding target value (e.g., a sensitivity of 0.6), and (3) a statistic and its corresponding target value that determines how the target value for the performance measure be reached (e.g., reaching a sensitivity of 0.6 with a probability of 0.8). The method consists of a Monte Carlo simulation step for computing the performance measure and the statistic for several sample sizes selected from an initial candidate sample size range, a curve-fitting step for interpolating the statistic across the entire candidate range, and a stratified bootstrapping step to quantify the uncertainty around the recommendation provided. We evaluated the performance of the method for the Gaussian Graphical Model, but it can easily extend to other models. The method displayed good performance, providing sample size recommendations that were, on average, within three observations of a benchmark sample size, with the highest standard deviation of 25.87 observations. The method discussed is implemented in the form of an R package called powerly, available on GitHub and CRAN. (PsycInfo Database Record (c) 2023 APA, all rights reserved).\",\"PeriodicalId\":20782,\"journal\":{\"name\":\"Psychological methods\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":7.8000,\"publicationDate\":\"2023-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Psychological methods\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.1037/met0000555\",\"RegionNum\":1,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychological methods","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/met0000555","RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

我们介绍了在横截面网络模型中计算样本大小的一般方法。该方法采用自动蒙特卡罗算法的形式，旨在找到最佳样本量，同时迭代地将计算集中在似乎最相关的样本量上。该方法需要三个输入:(1)假设的网络结构或该结构的期望特征，(2)估计性能度量及其相应的目标值(例如，灵敏度为0.6)，以及(3)决定如何达到性能度量的目标值的统计量及其相应的目标值(例如，以0.8的概率达到灵敏度为0.6)。该方法包括一个蒙特卡罗模拟步骤，用于计算从初始候选样本量范围中选择的几个样本量的性能度量和统计量，一个曲线拟合步骤，用于在整个候选样本量范围内插值统计量，以及一个分层自举步骤，用于量化所提供推荐的不确定性。我们评估了该方法在高斯图形模型上的性能，但它很容易扩展到其他模型。该方法表现出良好的性能，提供的样本量建议平均在基准样本量的三个观测值范围内，最高标准偏差为25.87观测值。所讨论的方法以一个名为powery的R包的形式实现，可以在GitHub和CRAN上获得。(PsycInfo数据库记录(c) 2023 APA，版权所有)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A general Monte Carlo method for sample size analysis in the context of network models.

We introduce a general method for sample size computations in the context of cross-sectional network models. The method takes the form of an automated Monte Carlo algorithm, designed to find an optimal sample size while iteratively concentrating the computations on the sample sizes that seem most relevant. The method requires three inputs: (1) a hypothesized network structure or desired characteristics of that structure, (2) an estimation performance measure and its corresponding target value (e.g., a sensitivity of 0.6), and (3) a statistic and its corresponding target value that determines how the target value for the performance measure be reached (e.g., reaching a sensitivity of 0.6 with a probability of 0.8). The method consists of a Monte Carlo simulation step for computing the performance measure and the statistic for several sample sizes selected from an initial candidate sample size range, a curve-fitting step for interpolating the statistic across the entire candidate range, and a stratified bootstrapping step to quantify the uncertainty around the recommendation provided. We evaluated the performance of the method for the Gaussian Graphical Model, but it can easily extend to other models. The method displayed good performance, providing sample size recommendations that were, on average, within three observations of a benchmark sample size, with the highest standard deviation of 25.87 observations. The method discussed is implemented in the form of an R package called powerly, available on GitHub and CRAN. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Psychological methods PSYCHOLOGY, MULTIDISCIPLINARY-

CiteScore

13.10

自引率

7.10%

发文量

159

期刊介绍： Psychological Methods is devoted to the development and dissemination of methods for collecting, analyzing, understanding, and interpreting psychological data. Its purpose is the dissemination of innovations in research design, measurement, methodology, and quantitative and qualitative analysis to the psychological community; its further purpose is to promote effective communication about related substantive and methodological issues. The audience is expected to be diverse and to include those who develop new procedures, those who are responsible for undergraduate and graduate training in design, measurement, and statistics, as well as those who employ those procedures in research.