首页 > 最新文献

Biometrika最新文献

英文 中文
Sharp symbolic nonparametric bounds for measures of benefit in observational and imperfect randomized studies with ordinal outcomes 具有序数结果的观察性研究和不完全随机研究中收益测量的锐利符号非参数界限
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-04-11 DOI: 10.1093/biomet/asae020
Erin E Gabriel, Michael C Sachs, Andreas Kryger Jensen
Summary The probability of benefit is a valuable and meaningful measure of treatment effect, which has advantages over the average treatment effect. Particularly for an ordinal outcome, it has a better interpretation and can make apparent different aspects of the treatment impact. Unfortunately, this measure, and variations of it, are not identifiable even in randomized trials with perfect compliance. There is, for this reason, a long literature on nonparametric bounds for unidentifiable measures of benefit. These have primarily focused on perfect randomized trial settings and one or two specific estimands. We expand these bounds to observational settings with unmeasured confounders and imperfect randomized trials for all three estimands considered in the literature: the probability of benefit, the probability of no harm, and the relative treatment effect.
摘要 受益概率是衡量治疗效果的一个有价值、有意义的指标,它比平均治疗效果更有优势。特别是对于序数结果,它有更好的解释,并能使治疗效果的不同方面显而易见。遗憾的是,即使是在完全符合要求的随机试验中,也无法识别这种测量方法及其变体。因此,关于无法识别的收益测量的非参数界限的文献有很长的篇幅。这些文献主要集中于完美随机试验环境和一两个特定的估计值。我们将这些界限扩展到具有未测量混杂因素和不完全随机试验的观察环境中,适用于文献中考虑的所有三种估计值:获益概率、无害概率和相对治疗效果。
{"title":"Sharp symbolic nonparametric bounds for measures of benefit in observational and imperfect randomized studies with ordinal outcomes","authors":"Erin E Gabriel, Michael C Sachs, Andreas Kryger Jensen","doi":"10.1093/biomet/asae020","DOIUrl":"https://doi.org/10.1093/biomet/asae020","url":null,"abstract":"Summary The probability of benefit is a valuable and meaningful measure of treatment effect, which has advantages over the average treatment effect. Particularly for an ordinal outcome, it has a better interpretation and can make apparent different aspects of the treatment impact. Unfortunately, this measure, and variations of it, are not identifiable even in randomized trials with perfect compliance. There is, for this reason, a long literature on nonparametric bounds for unidentifiable measures of benefit. These have primarily focused on perfect randomized trial settings and one or two specific estimands. We expand these bounds to observational settings with unmeasured confounders and imperfect randomized trials for all three estimands considered in the literature: the probability of benefit, the probability of no harm, and the relative treatment effect.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"49 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Individualized dynamic model for multi-resolutional data 多分辨率数据的个性化动态模型
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-04-08 DOI: 10.1093/biomet/asae015
J Zhang, F Xue, Q Xu, J Lee, A Qu
SUMMARY Mobile health has emerged as a major success for tracking individual health status, due to the popularity and power of smartphones and wearable devices. This has also brought great challenges in handling heterogeneous, multi-resolution data which arise ubiquitously in mobile health due to irregular multivariate measurements collected from individuals. In this paper, we propose an individualized dynamic latent factor model for irregular multi-resolution time series data to interpolate unsampled measurements of time series with low resolution. One major advantage of the proposed method is the capability to integrate multiple irregular time series and multiple subjects by mapping the multi-resolution data to the latent space. In addition, the proposed individualized dynamic latent factor model is applicable to capturing heterogeneous longitudinal information through individualized dynamic latent factors. Our theory provides a bound on the integrated interpolation error and the convergence rate for B-spline approximation methods. Both the simulation studies and the application to smartwatch data demonstrate the superior performance of the proposed method compared to existing methods.
摘要 由于智能手机和可穿戴设备的普及和功能强大,移动医疗已成为跟踪个人健康状况的主要成功手段。这也给处理异构多分辨率数据带来了巨大挑战,由于从个人收集到的不规则多变量测量数据,这些数据在移动健康领域无处不在。在本文中,我们提出了一种针对不规则多分辨率时间序列数据的个性化动态潜因模型,用于插值低分辨率时间序列的未采样测量值。所提方法的一大优势是,通过将多分辨率数据映射到潜空间,能够整合多个不规则时间序列和多个研究对象。此外,所提出的个体化动态潜因子模型适用于通过个体化动态潜因子捕捉异质纵向信息。我们的理论提供了 B-样条近似方法的综合插值误差和收敛速率的约束。模拟研究和对智能手表数据的应用都表明,与现有方法相比,所提出的方法具有更优越的性能。
{"title":"Individualized dynamic model for multi-resolutional data","authors":"J Zhang, F Xue, Q Xu, J Lee, A Qu","doi":"10.1093/biomet/asae015","DOIUrl":"https://doi.org/10.1093/biomet/asae015","url":null,"abstract":"SUMMARY Mobile health has emerged as a major success for tracking individual health status, due to the popularity and power of smartphones and wearable devices. This has also brought great challenges in handling heterogeneous, multi-resolution data which arise ubiquitously in mobile health due to irregular multivariate measurements collected from individuals. In this paper, we propose an individualized dynamic latent factor model for irregular multi-resolution time series data to interpolate unsampled measurements of time series with low resolution. One major advantage of the proposed method is the capability to integrate multiple irregular time series and multiple subjects by mapping the multi-resolution data to the latent space. In addition, the proposed individualized dynamic latent factor model is applicable to capturing heterogeneous longitudinal information through individualized dynamic latent factors. Our theory provides a bound on the integrated interpolation error and the convergence rate for B-spline approximation methods. Both the simulation studies and the application to smartwatch data demonstrate the superior performance of the proposed method compared to existing methods.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"3 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140568582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flexible control of the median of the false discovery proportion 灵活控制错误发现比例的中位数
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-03-23 DOI: 10.1093/biomet/asae018
Jesse Hemerik, Aldo Solari, Jelle J Goeman
We introduce a multiple testing procedure that controls the median of the proportion of false discoveries in a flexible way. The procedure only requires a vector of p-values as input and is comparable to the Benjamini–Hochberg method, which controls the mean of the proportion of false discoveries. Our method allows free choice of one or several values of alpha after seeing the data, unlike the Benjamini–Hochberg procedure, which can be very anti-conservative when alpha is chosen post hoc. We prove these claims and illustrate them with simulations. Our procedure is inspired by a popular estimator of the total number of true hypotheses. We adapt this estimator to provide simultaneously median unbiased estimators of the proportion of false discoveries, valid for finite samples. This simultaneity allows for the claimed flexibility. Our approach does not assume independence. The time complexity of our method is linear in the number of hypotheses, after sorting the p-values.
我们介绍了一种多重检验程序,它能以灵活的方式控制错误发现比例的中位数。该程序只需要一个 p 值向量作为输入,与控制错误发现比例均值的本杰明-霍奇伯格方法具有可比性。我们的方法允许在看到数据后自由选择一个或多个 alpha 值,这与 Benjamini-Hochberg 程序不同,后者在事后选择 alpha 值时可能非常不保守。我们证明了这些说法,并通过模拟进行了说明。我们的程序受到一个流行的真实假设总数估计器的启发。我们对这个估计器进行了调整,以同时提供对有限样本有效的错误发现比例的中值无偏估计器。这种同时性使我们获得了所宣称的灵活性。我们的方法不假定独立性。在对 p 值进行排序后,我们方法的时间复杂度与假设数量成线性关系。
{"title":"Flexible control of the median of the false discovery proportion","authors":"Jesse Hemerik, Aldo Solari, Jelle J Goeman","doi":"10.1093/biomet/asae018","DOIUrl":"https://doi.org/10.1093/biomet/asae018","url":null,"abstract":"We introduce a multiple testing procedure that controls the median of the proportion of false discoveries in a flexible way. The procedure only requires a vector of p-values as input and is comparable to the Benjamini–Hochberg method, which controls the mean of the proportion of false discoveries. Our method allows free choice of one or several values of alpha after seeing the data, unlike the Benjamini–Hochberg procedure, which can be very anti-conservative when alpha is chosen post hoc. We prove these claims and illustrate them with simulations. Our procedure is inspired by a popular estimator of the total number of true hypotheses. We adapt this estimator to provide simultaneously median unbiased estimators of the proportion of false discoveries, valid for finite samples. This simultaneity allows for the claimed flexibility. Our approach does not assume independence. The time complexity of our method is linear in the number of hypotheses, after sorting the p-values.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"309 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140199969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal regimes for algorithm-assisted human decision-making 算法辅助人类决策的最佳机制
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-03-19 DOI: 10.1093/biomet/asae016
M J Stensrud, J D Laurendeau, A L Sarvet
Summary We consider optimal regimes for algorithm-assisted human decision-making. Such regimes are decision functions of measured pre-treatment variables and, by leveraging natural treatment values, enjoy a superoptimality property whereby they are guaranteed to outperform conventional optimal regimes. When there is unmeasured confounding, the benefit of using superoptimal regimes can be considerable. When there is no unmeasured confounding, superoptimal regimes are identical to conventional optimal regimes. Furthermore, identification of the expected outcome under superoptimal regimes in non-experimental studies requires the same assumptions as identification of value functions under conventional optimal regimes when the treatment is binary. To illustrate the utility of superoptimal regimes, we derive identification and estimation results in a common instrumental variable setting. We use these derivations to analyse examples from the optimal regimes literature, including a case study of the effect of prompt intensive care treatment on survival.
摘要 我们考虑了算法辅助人类决策的最优机制。这种机制是测量的前处理变量的决策函数,通过利用自然处理值,它们具有超优特性,从而保证优于传统的最优机制。当存在无法测量的混杂因素时,使用超优化方案的好处可能相当可观。当不存在无法测量的混杂因素时,超最优制度与传统最优制度完全相同。此外,在非实验研究中,确定超最优制度下的预期结果所需的假设条件,与治疗为二元时确定传统最优制度下的价值函数所需的假设条件相同。为了说明超最优制度的实用性,我们推导了普通工具变量设置下的识别和估计结果。我们利用这些推导来分析最优制度文献中的例子,包括一个关于及时重症监护治疗对存活率影响的案例研究。
{"title":"Optimal regimes for algorithm-assisted human decision-making","authors":"M J Stensrud, J D Laurendeau, A L Sarvet","doi":"10.1093/biomet/asae016","DOIUrl":"https://doi.org/10.1093/biomet/asae016","url":null,"abstract":"Summary We consider optimal regimes for algorithm-assisted human decision-making. Such regimes are decision functions of measured pre-treatment variables and, by leveraging natural treatment values, enjoy a superoptimality property whereby they are guaranteed to outperform conventional optimal regimes. When there is unmeasured confounding, the benefit of using superoptimal regimes can be considerable. When there is no unmeasured confounding, superoptimal regimes are identical to conventional optimal regimes. Furthermore, identification of the expected outcome under superoptimal regimes in non-experimental studies requires the same assumptions as identification of value functions under conventional optimal regimes when the treatment is binary. To illustrate the utility of superoptimal regimes, we derive identification and estimation results in a common instrumental variable setting. We use these derivations to analyse examples from the optimal regimes literature, including a case study of the effect of prompt intensive care treatment on survival.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"309 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140199831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inference of partial correlations of a multivariate Gaussian time series 多变量高斯时间序列的局部相关性推理
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-02-26 DOI: 10.1093/biomet/asae012
A S DiLernia, M Fiecas, L Zhang
We derive an asymptotic joint distribution and novel covariance estimator for the partial correlations of a multivariate Gaussian time series given mild regularity conditions. Using our derived asymptotic distribution, we develop a Wald confidence interval and testing procedure for inference of individual partial correlations for time series data. Through simulation we demonstrate that our proposed confidence interval attains higher coverage rates, and our testing procedure attains false positive rates closer to the nominal levels than approaches that assume independent observations when autocorrelation is present.
在轻度正则性条件下,我们推导出了多元高斯时间序列偏相关性的渐近联合分布和新型协方差估计器。利用我们推导出的渐近分布,我们开发了一种 Wald 置信区间和测试程序,用于推断时间序列数据的单个偏相关性。通过仿真,我们证明了我们提出的置信区间能获得更高的覆盖率,而我们的测试程序能获得更接近名义水平的假阳性率。
{"title":"Inference of partial correlations of a multivariate Gaussian time series","authors":"A S DiLernia, M Fiecas, L Zhang","doi":"10.1093/biomet/asae012","DOIUrl":"https://doi.org/10.1093/biomet/asae012","url":null,"abstract":"We derive an asymptotic joint distribution and novel covariance estimator for the partial correlations of a multivariate Gaussian time series given mild regularity conditions. Using our derived asymptotic distribution, we develop a Wald confidence interval and testing procedure for inference of individual partial correlations for time series data. Through simulation we demonstrate that our proposed confidence interval attains higher coverage rates, and our testing procedure attains false positive rates closer to the nominal levels than approaches that assume independent observations when autocorrelation is present.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"101 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139977684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network-adjusted covariates for community detection 用于群落探测的网络调整协变量
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-02-24 DOI: 10.1093/biomet/asae011
Y Hu, W Wang
Community detection is a crucial task in network analysis that can be significantly improved by incorporating subject-level information, i.e., covariates. Existing methods have shown the effectiveness of using covariates on the low-degree nodes, but rarely discuss the case where communities have significantly different density levels, i.e. multiscale networks. In this paper, we introduce a novel method that addresses this challenge by constructing network-adjusted covariates, which leverage the network connections and covariates with a node-specific weight for each node. This weight can be calculated without tuning parameters. We present novel theoretical results on the strong consistency of our method under degree-corrected stochastic blockmodels with covariates, even in the presence of misspecification and multiple sparse communities. Additionally, we establish a general lower bound for the community detection problem when both network and covariates are present, and it shows our method is optimal for connection intensity up to a constant factor. Our method outperforms existing approaches in simulations and a LastFM app user network. We then compare our method with others on a statistics publication citation network where 30% of nodes are isolated, and our method produces reasonable and balanced results.
社群检测是网络分析中的一项重要任务,通过纳入主体层面的信息(即协变量),可以大大提高检测效率。现有的方法已经证明了在低度节点上使用协变量的有效性,但很少讨论社群密度水平存在显著差异的情况,即多尺度网络。在本文中,我们引入了一种新方法,通过构建网络调整协变量来应对这一挑战,该方法利用了网络连接和协变量,并为每个节点设定了特定于节点的权重。该权重无需调整参数即可计算。我们提出了新的理论结果,说明在带有协变量的度校正随机块模型下,我们的方法具有很强的一致性,即使在存在规范错误和多个稀疏群落的情况下也是如此。此外,我们还为同时存在网络和协变量时的群落检测问题建立了一般下界,并表明我们的方法在连接强度达到一个常数因子时是最优的。在模拟和 LastFM 应用程序用户网络中,我们的方法优于现有方法。然后,我们在统计出版物引用网络中将我们的方法与其他方法进行了比较,在该网络中,有 30% 的节点是孤立的,我们的方法产生了合理而均衡的结果。
{"title":"Network-adjusted covariates for community detection","authors":"Y Hu, W Wang","doi":"10.1093/biomet/asae011","DOIUrl":"https://doi.org/10.1093/biomet/asae011","url":null,"abstract":"Community detection is a crucial task in network analysis that can be significantly improved by incorporating subject-level information, i.e., covariates. Existing methods have shown the effectiveness of using covariates on the low-degree nodes, but rarely discuss the case where communities have significantly different density levels, i.e. multiscale networks. In this paper, we introduce a novel method that addresses this challenge by constructing network-adjusted covariates, which leverage the network connections and covariates with a node-specific weight for each node. This weight can be calculated without tuning parameters. We present novel theoretical results on the strong consistency of our method under degree-corrected stochastic blockmodels with covariates, even in the presence of misspecification and multiple sparse communities. Additionally, we establish a general lower bound for the community detection problem when both network and covariates are present, and it shows our method is optimal for connection intensity up to a constant factor. Our method outperforms existing approaches in simulations and a LastFM app user network. We then compare our method with others on a statistics publication citation network where 30% of nodes are isolated, and our method produces reasonable and balanced results.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"35 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139950512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selective conformal inference with false coverage-statement rate control 带错误覆盖率控制的选择性保形推理
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-02-19 DOI: 10.1093/biomet/asae010
Yajie Bao, Yuyang Huo, Haojie Ren, Changliang Zou
Conformal inference is a popular tool for constructing prediction intervals. We consider here the scenario of post-selection/selective conformal inference, that is prediction intervals are reported only for individuals selected from unlabelled test data. To account for multiplicity, we develop a general split conformal framework to construct selective prediction intervals with the false coverage-statement rate control. We first investigate benjamini2005false's false coverage rate-adjusted method in the present setting, and show that it is able to achieve false coverage-statement rate control but yields uniformly inflated prediction intervals. We then propose a novel solution to the problem called selective conditional conformal prediction. Our method performs selection procedures on both the calibration set and test set, and then constructs conformal prediction intervals for the selected test candidates with the aid of conditional empirical distribution obtained by the post-selection calibration set. When the selection rule is exchangeable, we show that our proposed method can exactly control the false coverage-statement rate in a model-free and distribution-free guarantee. For non-exchangeable selection procedures involving the calibration set, we provide non-asymptotic bounds for the false coverage-statement rate under mild distributional assumptions. Numerical results confirm the effectiveness and robustness of our method in false coverage-statement rate control and show that it achieves more narrowed prediction intervals over existing methods across various settings.
共形推理是构建预测区间的常用工具。我们在此考虑的是后选择/选择性共形推理的情况,即预测区间仅针对从无标签测试数据中选出的个体进行报告。为了考虑多重性,我们开发了一个通用的分裂保形框架,以构建具有错误覆盖率控制的选择性预测区间。我们首先研究了 benjamini2005false 在当前环境下的虚假覆盖率调整方法,结果表明该方法能够实现虚假覆盖率控制,但会产生均匀膨胀的预测区间。然后,我们提出了一种新的解决方案,称为选择性条件保形预测。我们的方法同时对校准集和测试集执行选择程序,然后借助选择后校准集获得的条件经验分布为选定的候选测试构建保形预测区间。当选择规则是可交换的时,我们证明我们提出的方法可以在无模型和无分布的保证下精确控制错误覆盖率。对于涉及校准集的不可交换选择程序,我们在温和的分布假设条件下提供了虚假覆盖率的非渐近界限。数值结果证实了我们的方法在控制虚假覆盖率方面的有效性和稳健性,并表明在各种情况下,我们的方法比现有方法实现了更窄的预测区间。
{"title":"Selective conformal inference with false coverage-statement rate control","authors":"Yajie Bao, Yuyang Huo, Haojie Ren, Changliang Zou","doi":"10.1093/biomet/asae010","DOIUrl":"https://doi.org/10.1093/biomet/asae010","url":null,"abstract":"Conformal inference is a popular tool for constructing prediction intervals. We consider here the scenario of post-selection/selective conformal inference, that is prediction intervals are reported only for individuals selected from unlabelled test data. To account for multiplicity, we develop a general split conformal framework to construct selective prediction intervals with the false coverage-statement rate control. We first investigate benjamini2005false's false coverage rate-adjusted method in the present setting, and show that it is able to achieve false coverage-statement rate control but yields uniformly inflated prediction intervals. We then propose a novel solution to the problem called selective conditional conformal prediction. Our method performs selection procedures on both the calibration set and test set, and then constructs conformal prediction intervals for the selected test candidates with the aid of conditional empirical distribution obtained by the post-selection calibration set. When the selection rule is exchangeable, we show that our proposed method can exactly control the false coverage-statement rate in a model-free and distribution-free guarantee. For non-exchangeable selection procedures involving the calibration set, we provide non-asymptotic bounds for the false coverage-statement rate under mild distributional assumptions. Numerical results confirm the effectiveness and robustness of our method in false coverage-statement rate control and show that it achieves more narrowed prediction intervals over existing methods across various settings.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"21 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139950365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Promises of Parallel Outcomes 并行成果的承诺
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-02-17 DOI: 10.1093/biomet/asae008
Ying Zhou, Dingke Tang, Dehan Kong, Linbo Wang
A key challenge in causal inference from observational studies is the identification and estimation of causal effects in the presence of unmeasured confounding. In this paper, we introduce a novel approach for causal inference that leverages information in multiple outcomes to deal with unmeasured confounding. An important assumption in our approach is conditional independence among multiple outcomes. In contrast to existing proposals in the literature, the roles of multiple outcomes in the conditional independence assumption are symmetric, hence the name parallel outcomes. We show nonparametric identifiability with at least three parallel outcomes and provide parametric estimation tools under a set of linear structural equation models. Our proposal is evaluated through a set of synthetic and real data analyses.
从观察性研究中进行因果推断的一个主要挑战是在存在未测量混杂因素的情况下识别和估计因果效应。在本文中,我们介绍了一种新的因果推断方法,该方法利用多个结果的信息来处理无法测量的混杂因素。我们方法的一个重要假设是多个结果之间的条件独立性。与现有文献中的建议不同,条件独立性假设中多个结果的角色是对称的,因此被称为平行结果。我们展示了至少三个平行结果的非参数可识别性,并提供了一组线性结构方程模型下的参数估计工具。我们通过一组合成数据和真实数据分析对我们的建议进行了评估。
{"title":"The Promises of Parallel Outcomes","authors":"Ying Zhou, Dingke Tang, Dehan Kong, Linbo Wang","doi":"10.1093/biomet/asae008","DOIUrl":"https://doi.org/10.1093/biomet/asae008","url":null,"abstract":"A key challenge in causal inference from observational studies is the identification and estimation of causal effects in the presence of unmeasured confounding. In this paper, we introduce a novel approach for causal inference that leverages information in multiple outcomes to deal with unmeasured confounding. An important assumption in our approach is conditional independence among multiple outcomes. In contrast to existing proposals in the literature, the roles of multiple outcomes in the conditional independence assumption are symmetric, hence the name parallel outcomes. We show nonparametric identifiability with at least three parallel outcomes and provide parametric estimation tools under a set of linear structural equation models. Our proposal is evaluated through a set of synthetic and real data analyses.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"191 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139924069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Doubly robust estimation under covariate-induced dependent left truncation 协变量诱发的依存左截断条件下的双稳健估计
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-02-11 DOI: 10.1093/biomet/asae005
Yuyao Wang, Andrew Ying, Ronghui Xu
Summary In prevalent cohort studies with follow-up, the time-to-event outcome is subject to left truncation leading to selection bias. For estimation of the distribution of time-to-event, conventional methods adjusting for left truncation tend to rely on the quasi-independence assumption that the truncation time and the event time are independent on the observed region. This assumption is violated when there is dependence between the truncation time and the event time possibly induced by measured covariates. Inverse probability of truncation weighting can be used in this case, but it is sensitive to misspecification of the truncation model. In this work, we apply semiparametric theory to find the efficient influence curve of the expectation of an arbitrarily transformed survival time in the presence of covariate-induced dependent left truncation. We then use it to construct estimators that are shown to enjoy double-robustness properties. Our work represents the first attempt to construct doubly robust estimators in the presence of left truncation, which does not fall under the established framework of coarsened data where doubly robust approaches were developed. We provide technical conditions for the asymptotic properties that appear to not have been carefully examined in the literature for time-to-event data, and study the estimators via extensive simulation. We apply the estimators to two datasets from practice, with different right-censoring patterns.
摘要 在有随访的流行队列研究中,从时间到事件的结果会出现左截断,从而导致选择偏差。对于事件发生时间分布的估计,调整左截断的传统方法往往依赖于准独立性假设,即截断时间和事件发生时间在观察区域内是独立的。当截断时间和事件时间之间存在可能由测量协变量引起的依赖关系时,这一假设就被打破了。在这种情况下,可以使用截断加权的反概率,但它对截断模型的错误规范很敏感。在这项研究中,我们运用半参数理论,找到了在存在协变量诱导的左截断情况下,任意转换的生存时间期望的有效影响曲线。然后,我们用它来构建估计器,并证明这些估计器具有双重稳健性。我们的工作代表了在左截断情况下构建双重稳健估计器的首次尝试,而左截断并不属于已开发出双重稳健方法的粗化数据既定框架。我们提供了渐近特性的技术条件,这些条件在时间到事件数据的文献中似乎还没有仔细研究过,我们通过大量的模拟来研究这些估计器。我们将估计器应用于两个具有不同右删减模式的实践数据集。
{"title":"Doubly robust estimation under covariate-induced dependent left truncation","authors":"Yuyao Wang, Andrew Ying, Ronghui Xu","doi":"10.1093/biomet/asae005","DOIUrl":"https://doi.org/10.1093/biomet/asae005","url":null,"abstract":"Summary In prevalent cohort studies with follow-up, the time-to-event outcome is subject to left truncation leading to selection bias. For estimation of the distribution of time-to-event, conventional methods adjusting for left truncation tend to rely on the quasi-independence assumption that the truncation time and the event time are independent on the observed region. This assumption is violated when there is dependence between the truncation time and the event time possibly induced by measured covariates. Inverse probability of truncation weighting can be used in this case, but it is sensitive to misspecification of the truncation model. In this work, we apply semiparametric theory to find the efficient influence curve of the expectation of an arbitrarily transformed survival time in the presence of covariate-induced dependent left truncation. We then use it to construct estimators that are shown to enjoy double-robustness properties. Our work represents the first attempt to construct doubly robust estimators in the presence of left truncation, which does not fall under the established framework of coarsened data where doubly robust approaches were developed. We provide technical conditions for the asymptotic properties that appear to not have been carefully examined in the literature for time-to-event data, and study the estimators via extensive simulation. We apply the estimators to two datasets from practice, with different right-censoring patterns.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"45 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139769979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression analysis of group-tested current status data 对分组测试的现状数据进行回归分析
IF 2.7 2区 数学 Q2 BIOLOGY Pub Date : 2024-02-08 DOI: 10.1093/biomet/asae006
Shuwei Li, Tao Hu, Lianming Wang, Christopher S McMahan, Joshua M Tebbs
Summary Group testing is an effective way to reduce the time and cost associated with conducting large-scale screening for infectious diseases. Benefits are realized through testing pools formed by combining specimens, such as blood or urine, from different individuals. In some studies, individuals are assessed only once and a time-to-event endpoint is recorded, for example, the time until infection. Combining group testing with this type of endpoint results in group-tested current status data (?). To analyse these complex data, we propose methods which estimate a proportional hazards regression model based on test outcomes from measuring the pools. A sieve maximum likelihood estimation approach is developed that approximates the cumulative baseline hazard function with a piecewise constant function. To identify the sieve estimator, a computationally efficient expectation-maximization algorithm is derived by using data augmentation. Asymptotic properties of both the parametric and nonparametric components of the sieve estimator are then established by applying modern empirical process theory. Numerical results from simulation studies show that our proposed method performs nominally and has advantages over the corresponding estimation method based on individual testing results. We illustrate our work by analysing a chlamydia dataset collected by the State Hygienic Laboratory at the University of Iowa.
摘要 群体检测是减少大规模传染病筛查所需的时间和成本的有效方法。通过将不同个体的血液或尿液等标本组合成检测池,可实现效益。在某些研究中,只对个体进行一次评估,并记录时间终点,如感染前的时间。将分组检测与这类终点相结合,就会得到分组检测的当前状态数据(?)为了分析这些复杂的数据,我们提出了一些方法,这些方法基于测量池的测试结果来估计比例危险回归模型。我们开发了一种筛网最大似然估计方法,用一个片断常数函数来近似累积基线危害函数。为了确定筛网估计器,利用数据扩增推导出了一种计算高效的期望最大化算法。然后,通过应用现代经验过程理论,建立了筛网估计器的参数和非参数部分的渐近特性。模拟研究的数值结果表明,与基于单个测试结果的相应估算方法相比,我们提出的方法具有显著的性能和优势。我们通过分析爱荷华大学国家卫生实验室收集的衣原体数据集来说明我们的工作。
{"title":"Regression analysis of group-tested current status data","authors":"Shuwei Li, Tao Hu, Lianming Wang, Christopher S McMahan, Joshua M Tebbs","doi":"10.1093/biomet/asae006","DOIUrl":"https://doi.org/10.1093/biomet/asae006","url":null,"abstract":"Summary Group testing is an effective way to reduce the time and cost associated with conducting large-scale screening for infectious diseases. Benefits are realized through testing pools formed by combining specimens, such as blood or urine, from different individuals. In some studies, individuals are assessed only once and a time-to-event endpoint is recorded, for example, the time until infection. Combining group testing with this type of endpoint results in group-tested current status data (?). To analyse these complex data, we propose methods which estimate a proportional hazards regression model based on test outcomes from measuring the pools. A sieve maximum likelihood estimation approach is developed that approximates the cumulative baseline hazard function with a piecewise constant function. To identify the sieve estimator, a computationally efficient expectation-maximization algorithm is derived by using data augmentation. Asymptotic properties of both the parametric and nonparametric components of the sieve estimator are then established by applying modern empirical process theory. Numerical results from simulation studies show that our proposed method performs nominally and has advantages over the corresponding estimation method based on individual testing results. We illustrate our work by analysing a chlamydia dataset collected by the State Hygienic Laboratory at the University of Iowa.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"25 1","pages":""},"PeriodicalIF":2.7,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139770356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Biometrika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1