首页 > 最新文献

Journal of Statistical Planning and Inference最新文献

英文 中文
Statistical inference for wavelet curve estimators of symmetric positive definite matrices 对称正定矩阵小波曲线估计器的统计推理
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2024-01-09 DOI: 10.1016/j.jspi.2023.106140
Daniel Rademacher , Johannes Krebs , Rainer von Sachs

In this paper we treat statistical inference for a wavelet estimator of curves of symmetric positive definite (SPD) using the log-Euclidean distance. This estimator preserves positive-definiteness and enjoys permutation-equivariance, which is particularly relevant for covariance matrices. Our second-generation wavelet estimator is based on average-interpolation (AI) and allows the same powerful properties, including fast algorithms, known from nonparametric curve estimation with wavelets in standard Euclidean set-ups. The core of our work is the proposition of confidence sets for our AI wavelet estimator in a non-Euclidean geometry. We derive asymptotic normality of this estimator, including explicit expressions of its asymptotic variance. This opens the door for constructing asymptotic confidence regions which we compare with our proposed bootstrap scheme for inference. Detailed numerical simulations confirm the appropriateness of our suggested inference schemes.

本文利用对数欧氏距离对对称正定(SPD)曲线的小波估计器进行统计推断。该估计器保留了正定性并具有包换方差性,这与协方差矩阵尤其相关。我们的第二代小波估计器基于平均插值(AI),具有与标准欧几里得设置中的小波非参数曲线估计器相同的强大特性,包括快速算法。我们工作的核心是为非欧几里得几何中的 AI 小波估计器提出置信集。我们推导出该估计器的渐近正态性,包括其渐近方差的明确表达式。这为我们构建渐近置信区域打开了大门,我们将这些置信区域与我们提出的引导推理方案进行比较。详细的数值模拟证实了我们建议的推理方案的适当性。
{"title":"Statistical inference for wavelet curve estimators of symmetric positive definite matrices","authors":"Daniel Rademacher ,&nbsp;Johannes Krebs ,&nbsp;Rainer von Sachs","doi":"10.1016/j.jspi.2023.106140","DOIUrl":"https://doi.org/10.1016/j.jspi.2023.106140","url":null,"abstract":"<div><p><span>In this paper we treat statistical inference<span> for a wavelet estimator of curves of symmetric positive definite (SPD) using the log-Euclidean distance. This estimator preserves positive-definiteness and enjoys permutation-equivariance, which is particularly relevant for covariance matrices. Our second-generation wavelet estimator is based on average-interpolation (AI) and allows the same powerful properties, including fast algorithms, known from nonparametric curve estimation with wavelets in standard Euclidean set-ups. The core of our work is the </span></span>proposition<span> of confidence sets for our AI wavelet estimator in a non-Euclidean geometry. We derive asymptotic normality<span> of this estimator, including explicit expressions of its asymptotic variance<span>. This opens the door for constructing asymptotic confidence regions which we compare with our proposed bootstrap scheme for inference. Detailed numerical simulations confirm the appropriateness of our suggested inference schemes.</span></span></span></p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139433656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uniformly more powerful tests for a subset of the components of a Normal Mean Vector 对正态均值向量的子集成分进行统一的更强大测试
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-27 DOI: 10.1016/j.jspi.2023.106141
Yining Wang , Gang Li

A class of tests that are uniformly more powerful than the likelihood ratio test is derived for testing the hypothesis about the means of a subset of the components of a multivariate normal distribution with unknown covariance matrix, when the means of the other subset of the components are known.

推导出一类比似然比检验更有效的检验方法,用于检验具有未知协方差矩阵的多元正态分布中一个子集分量的均值假设,而另一个子集分量的均值是已知的。
{"title":"Uniformly more powerful tests for a subset of the components of a Normal Mean Vector","authors":"Yining Wang ,&nbsp;Gang Li","doi":"10.1016/j.jspi.2023.106141","DOIUrl":"10.1016/j.jspi.2023.106141","url":null,"abstract":"<div><p>A class of tests that are uniformly more powerful than the likelihood ratio test<span> is derived for testing the hypothesis about the means of a subset of the components of a multivariate normal distribution<span> with unknown covariance matrix, when the means of the other subset of the components are known.</span></span></p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139070464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse multiple kernel learning: Minimax rates with random projection 稀疏多核学习:随机投影的最小率
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-27 DOI: 10.1016/j.jspi.2023.106142
Wenqi Lu , Zhongyi Zhu , Rui Li , Heng Lian

In kernel-based learning, the random projection method, also called random sketching, has been successfully used in kernel ridge regression to reduce the computational burden in the big data setting, and at the same time retain the minimax convergence rate. In this work, we consider its use in sparse multiple kernel learning problems where a closed-form optimizer is not available, which poses significant technical challenges, for which the existing results do not carry over directly. Even when random projection is not used, our risk bound improves on the existing results in several aspects. We also illustrate the use of random projection via some numerical examples.

在基于内核的学习中,随机投影法(又称随机草图法)已成功应用于内核脊回归,以减轻大数据环境下的计算负担,同时保留最小收敛率。在这项工作中,我们考虑将其用于稀疏多核学习问题中,因为在这些问题中没有闭式优化器,这带来了巨大的技术挑战,而现有的结果并不能直接用于这些问题。即使不使用随机投影,我们的风险边界也在多个方面改进了现有结果。我们还通过一些数值示例说明了随机投影的使用。
{"title":"Sparse multiple kernel learning: Minimax rates with random projection","authors":"Wenqi Lu ,&nbsp;Zhongyi Zhu ,&nbsp;Rui Li ,&nbsp;Heng Lian","doi":"10.1016/j.jspi.2023.106142","DOIUrl":"10.1016/j.jspi.2023.106142","url":null,"abstract":"<div><p>In kernel-based learning, the random projection method, also called random sketching, has been successfully used in kernel ridge regression to reduce the computational burden in the big data setting, and at the same time retain the minimax convergence rate. In this work, we consider its use in sparse multiple kernel learning problems where a closed-form optimizer is not available, which poses significant technical challenges, for which the existing results do not carry over directly. Even when random projection is not used, our risk bound improves on the existing results in several aspects. We also illustrate the use of random projection via some numerical examples.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139070664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new First-Order mixture integer-valued threshold autoregressive process based on binomial thinning and negative binomial thinning 基于二项稀疏化和负二项稀疏化的新一阶混合整数值阈值自回归过程
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-26 DOI: 10.1016/j.jspi.2023.106143
Danshu Sheng , Dehui Wang , Liuquan Sun

In this paper, we introduce a new first-order mixture integer-valued threshold autoregressive process, based on the binomial and negative binomial thinning operators. Basic probabilistic and statistical properties of this model are discussed. Conditional least squares (CLS) and conditional maximum likelihood (CML) estimators are derived and the asymptotic properties of the estimators are established. The inference for the threshold parameter is obtained based on the CLS and CML score functions. Moreover, the Wald test is applied to detect the existence of the piecewise structure. Simulation studies are considered, along with an application: the number of criminal mischief incidents in the Pittsburgh dataset

本文基于二项式和负二项式稀疏算子,介绍了一种新的一阶混合整数值阈值自回归过程。本文讨论了该模型的基本概率和统计特性。推导出条件最小二乘法(CLS)和条件最大似然法(CML)估计器,并确定了估计器的渐近特性。根据 CLS 和 CML 分数函数推断出了阈值参数。此外,还应用 Wald 检验来检测是否存在片断结构。我们还考虑了模拟研究以及一个应用:匹兹堡数据集中的刑事恶作剧事件数量。
{"title":"A new First-Order mixture integer-valued threshold autoregressive process based on binomial thinning and negative binomial thinning","authors":"Danshu Sheng ,&nbsp;Dehui Wang ,&nbsp;Liuquan Sun","doi":"10.1016/j.jspi.2023.106143","DOIUrl":"10.1016/j.jspi.2023.106143","url":null,"abstract":"<div><p><span><span>In this paper, we introduce a new first-order mixture integer-valued threshold autoregressive process, based on the binomial and </span>negative binomial thinning operators. Basic probabilistic and statistical properties of this model are discussed. Conditional least squares (CLS) and conditional maximum likelihood (CML) estimators are derived and the </span>asymptotic properties<span> of the estimators are established. The inference for the threshold parameter is obtained based on the CLS and CML score functions<span>. Moreover, the Wald test is applied to detect the existence of the piecewise structure. Simulation studies are considered, along with an application: the number of criminal mischief incidents in the Pittsburgh dataset</span></span></p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139070374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designs for half-diallel experiments with commutative orthogonal block structure 采用换元正交块结构的半二列实验设计
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-21 DOI: 10.1016/j.jspi.2023.106139
R.A. Bailey, Peter J. Cameron, Dário Ferreira, Sandra S. Ferreira, Célia Nunes

In some experiments, the experimental units are all pairs of individuals who have to undertake a given task together. The set of such pairs forms a triangular association scheme. Appropriate randomization then gives two non-trivial strata. The design is said to have commutative orthogonal block structure (COBS) if the best linear unbiased estimators of treatment contrasts do not depend on the stratum variances. There are precisely three ways in which such a design can have COBS. We give a complete description of designs for which all treatment contrasts are in the same stratum. Then we give a very general construction for designs with COBS which have some treatment contrasts in each stratum.

在某些实验中,实验单元是必须共同完成某项任务的所有成对个体。这些配对的集合构成了一个三角形关联方案。适当的随机化可以得到两个非三角形的分层。如果处理对比的最佳线性无偏估计值不依赖于分层方差,则称该设计具有换元正交区组结构(COBS)。有三种方法可以使这种设计具有 COBS。我们将完整描述所有治疗对比都在同一分层的设计。然后,我们给出了具有 COBS 的设计的一般构造,即在每个分层中都有一些处理对比。
{"title":"Designs for half-diallel experiments with commutative orthogonal block structure","authors":"R.A. Bailey, Peter J. Cameron, Dário Ferreira, Sandra S. Ferreira, Célia Nunes","doi":"10.1016/j.jspi.2023.106139","DOIUrl":"https://doi.org/10.1016/j.jspi.2023.106139","url":null,"abstract":"<p>In some experiments, the experimental units are all pairs of individuals who have to undertake a given task together. The set of such pairs forms a triangular association scheme. Appropriate randomization then gives two non-trivial strata. The design is said to have commutative orthogonal block structure (COBS) if the best linear unbiased estimators of treatment contrasts do not depend on the stratum variances. There are precisely three ways in which such a design can have COBS. We give a complete description of designs for which all treatment contrasts are in the same stratum. Then we give a very general construction for designs with COBS which have some treatment contrasts in each stratum.</p>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139027909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designs for half-diallel experiments with commutative orthogonal block structure 采用换元正交块结构的半二列实验设计
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-21 DOI: 10.1016/j.jspi.2023.106139
R.A. Bailey , Peter J. Cameron , Dário Ferreira , Sandra S. Ferreira , Célia Nunes

In some experiments, the experimental units are all pairs of individuals who have to undertake a given task together. The set of such pairs forms a triangular association scheme. Appropriate randomization then gives two non-trivial strata. The design is said to have commutative orthogonal block structure (COBS) if the best linear unbiased estimators of treatment contrasts do not depend on the stratum variances. There are precisely three ways in which such a design can have COBS. We give a complete description of designs for which all treatment contrasts are in the same stratum. Then we give a very general construction for designs with COBS which have some treatment contrasts in each stratum.

在某些实验中,实验单元是必须共同完成某项任务的所有成对个体。这些配对的集合构成了一个三角形关联方案。适当的随机化可以得到两个非三角形的分层。如果处理对比的最佳线性无偏估计值不依赖于分层方差,则称该设计具有换元正交区组结构(COBS)。有三种方法可以使这种设计具有 COBS。我们将完整描述所有治疗对比都在同一分层的设计。然后,我们给出了具有 COBS 的设计的一般构造,即在每个分层中都有一些处理对比。
{"title":"Designs for half-diallel experiments with commutative orthogonal block structure","authors":"R.A. Bailey ,&nbsp;Peter J. Cameron ,&nbsp;Dário Ferreira ,&nbsp;Sandra S. Ferreira ,&nbsp;Célia Nunes","doi":"10.1016/j.jspi.2023.106139","DOIUrl":"10.1016/j.jspi.2023.106139","url":null,"abstract":"<div><p>In some experiments, the experimental units are all pairs of individuals who have to undertake a given task together. The set of such pairs forms a triangular association scheme. Appropriate randomization then gives two non-trivial strata. The design is said to have commutative orthogonal block structure (COBS) if the best linear unbiased estimators of treatment contrasts do not depend on the stratum variances. There are precisely three ways in which such a design can have COBS. We give a complete description of designs for which all treatment contrasts are in the same stratum. Then we give a very general construction for designs with COBS which have some treatment contrasts in each stratum.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0378375823001088/pdfft?md5=d9967380e48aaefeeec7c0b3f9545df6&pid=1-s2.0-S0378375823001088-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139016804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptively robust high-dimensional matrix factor analysis under Huber loss function 胡贝尔损失函数下的自适应鲁棒高维矩阵因子分析
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-20 DOI: 10.1016/j.jspi.2023.106137
Yinzhi Wang , Yingqiu Zhu , Qiang Sun , Lei Qin

The explosion of data volume and the expansion in data dimensionality have led to a critical challenge in analyzing high-dimensional matrix time series for big data-related applications. In this regard, factor models for matrix-valued high-dimensional time series provide a powerful tool, that reduces the dimensionality of the variables with low-rank structures. However, existing high-dimensional matrix factor models suffer from two limitations in complex scenarios. One is that it is difficult to make robust inferences for datasets with heavy-tailed distributions. The other is that existing models require additional parameters for fine-tuning to guarantee performance. We propose an adaptively robust high-dimensional matrix factor model based on a specified Huber loss function to tackle the challenges mentioned above. An efficient iterative algorithm is provided to consistently determine the additional parameters of our model for robust estimation. The robustness of the model estimation is greatly improved by incorporating the Huber loss. Furthermore, we theoretically investigate the proposed method and derive the convergence rates of the robust estimators to examine its performance. Simulations show that the proposed method outperforms previous models in the estimation of heavy-tailed data. A real-world data analysis on a financial portfolio dataset illustrates that the method can be used to extract useful knowledge from high-dimensional matrix time series.

数据量的爆炸式增长和数据维度的扩大,给大数据相关应用中的高维矩阵时间序列分析带来了严峻挑战。在这方面,用于矩阵值高维时间序列的因子模型提供了一个强大的工具,可以降低具有低秩结构的变量的维度。然而,现有的高维矩阵因子模型在复杂场景中存在两个局限性。一是难以对重尾分布的数据集进行稳健推断。另一个是现有模型需要额外的参数进行微调才能保证性能。我们提出了一种基于指定 Huber 损失函数的自适应稳健高维矩阵因子模型,以应对上述挑战。我们提供了一种高效的迭代算法,以持续确定模型的附加参数,从而实现稳健估计。加入 Huber 损失后,模型估计的稳健性大大提高。此外,我们从理论上研究了所提出的方法,并推导出稳健估计器的收敛率,以检验其性能。模拟结果表明,在重尾数据的估计中,所提出的方法优于之前的模型。对金融投资组合数据集的实际数据分析表明,该方法可用于从高维矩阵时间序列中提取有用的知识。
{"title":"Adaptively robust high-dimensional matrix factor analysis under Huber loss function","authors":"Yinzhi Wang ,&nbsp;Yingqiu Zhu ,&nbsp;Qiang Sun ,&nbsp;Lei Qin","doi":"10.1016/j.jspi.2023.106137","DOIUrl":"10.1016/j.jspi.2023.106137","url":null,"abstract":"<div><p>The explosion of data volume and the expansion in data dimensionality have led to a critical challenge in analyzing high-dimensional matrix time series for big data-related applications. In this regard, factor models for matrix-valued high-dimensional time series provide a powerful tool, that reduces the dimensionality of the variables with low-rank structures. However, existing high-dimensional matrix factor models suffer from two limitations in complex scenarios. One is that it is difficult to make robust inferences for datasets with heavy-tailed distributions. The other is that existing models require additional parameters for fine-tuning to guarantee performance. We propose an adaptively robust high-dimensional matrix factor model based on a specified Huber loss function to tackle the challenges mentioned above. An efficient iterative algorithm is provided to consistently determine the additional parameters of our model for robust estimation. The robustness of the model estimation is greatly improved by incorporating the Huber loss. Furthermore, we theoretically investigate the proposed method and derive the convergence rates of the robust estimators to examine its performance. Simulations show that the proposed method outperforms previous models in the estimation of heavy-tailed data. A real-world data analysis on a financial portfolio dataset illustrates that the method can be used to extract useful knowledge from high-dimensional matrix time series.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138817793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kernel estimation of the transition density in bifurcating Markov chains 分叉马尔可夫链中过渡密度的核估计
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-20 DOI: 10.1016/j.jspi.2023.106138
S. Valère Bitseki Penda

We study the kernel estimators of the transition density of bifurcating Markov chains. Under some ergodic and regularity properties, we prove that these estimators are consistent and asymptotically normal. Next, in the numerical studies, we propose two data-driven methods to choose the bandwidth parameters. These methods, based on the so-called two bandwidths approach, are adaptation for bifurcating Markov chains of the least squares Cross-Validation and the rule of thumb method. Finally, we provide an example with real data.

我们研究了分叉马尔可夫链过渡密度的核估计量。在一些遍历和正则特性下,我们证明了这些估计值是一致和渐近正态的。接下来,在数值研究中,我们提出了两种数据驱动的带宽参数选择方法。这些方法基于所谓的双带宽方法,适用于最小二乘交叉验证法和经验法则法的分叉马尔可夫链。最后,我们提供了一个使用真实数据的示例。
{"title":"Kernel estimation of the transition density in bifurcating Markov chains","authors":"S. Valère Bitseki Penda","doi":"10.1016/j.jspi.2023.106138","DOIUrl":"10.1016/j.jspi.2023.106138","url":null,"abstract":"<div><p><span>We study the kernel estimators<span><span> of the transition density of bifurcating Markov chains. Under some ergodic and </span>regularity properties, we prove that these estimators are consistent and asymptotically normal. Next, in the </span></span>numerical studies, we propose two data-driven methods to choose the bandwidth parameters. These methods, based on the so-called two bandwidths approach, are adaptation for bifurcating Markov chains of the least squares Cross-Validation and the rule of thumb method. Finally, we provide an example with real data.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139028106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal subsampling for the Cox proportional hazards model with massive survival data 大量生存数据的考克斯比例危害模型的最佳子采样
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-19 DOI: 10.1016/j.jspi.2023.106136
Nan Qiao , Wangcheng Li , Feng Xiao , Cunjie Lin

Massive survival data has become common in survival analysis. In this study, a subsampling algorithm is proposed for Cox proportional hazards model with time-dependent covariates when the sample size is extraordinarily large but the computing resources are relatively limited. A subsample estimator is developed by maximizing a weighted partial likelihood, and shown to have consistency and asymptotic normality. By minimizing the asymptotic mean squared error of the subsample estimator, the optimal subsampling probabilities are formulated with explicit expression. Simulation studies show that the proposed method has satisfactory performances in approximating the full data estimator. The proposed method is applied to the corporate loan data and breast cancer data, with different censoring rates, and the outcome also confirms the practical advantages.

海量生存数据已成为生存分析中的常见现象。本研究提出了一种子采样算法,用于具有时间依赖协变量的 Cox 比例危险模型,当样本量超大但计算资源相对有限时。通过最大化加权部分似然建立了一个子样本估计器,并证明其具有一致性和渐近正态性。通过最小化子样本估计器的渐近均方误差,用明确的表达式提出了最优子样本概率。模拟研究表明,所提出的方法在逼近完整数据估计器方面具有令人满意的性能。该方法被应用于具有不同删失率的企业贷款数据和乳腺癌数据,结果也证实了其实用优势。
{"title":"Optimal subsampling for the Cox proportional hazards model with massive survival data","authors":"Nan Qiao ,&nbsp;Wangcheng Li ,&nbsp;Feng Xiao ,&nbsp;Cunjie Lin","doi":"10.1016/j.jspi.2023.106136","DOIUrl":"10.1016/j.jspi.2023.106136","url":null,"abstract":"<div><p><span><span>Massive survival data has become common in survival analysis. In this study, a subsampling algorithm is proposed for </span>Cox proportional hazards model with time-dependent </span>covariates<span> when the sample size is extraordinarily large but the computing resources are relatively limited. A subsample estimator is developed by maximizing a weighted partial likelihood, and shown to have consistency and asymptotic normality<span>. By minimizing the asymptotic mean squared error of the subsample estimator, the optimal subsampling probabilities are formulated with explicit expression. Simulation studies show that the proposed method has satisfactory performances in approximating the full data estimator. The proposed method is applied to the corporate loan data and breast cancer data, with different censoring rates, and the outcome also confirms the practical advantages.</span></span></p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138817749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression analysis of longitudinal data with mixed synchronous and asynchronous longitudinal covariates 使用混合同步和非同步纵向协变量对纵向数据进行回归分析
IF 0.9 4区 数学 Q2 Mathematics Pub Date : 2023-12-09 DOI: 10.1016/j.jspi.2023.106135
Zhuowei Sun , Hongyuan Cao , Li Chen , Jason P. Fine

In linear models, omitting a covariate that is orthogonal to covariates in the model does not result in biased coefficient estimation. This generally does not hold for longitudinal data, where additional assumptions are needed to get an unbiased coefficient estimation in addition to the orthogonality between omitted longitudinal covariates and longitudinal covariates in the model. We propose methods to mitigate the omitted variable bias under weaker assumptions. A two-step estimation procedure is proposed to infer the asynchronous longitudinal covariates when such covariates are observed. For mixed synchronous and asynchronous longitudinal covariates, we get a parametric convergence rate for the coefficient estimation of the synchronous longitudinal covariates by the two-step method. Extensive simulation studies provide numerical support for the theoretical findings. We illustrate the performance of our method on a dataset from the Alzheimer’s Disease Neuroimaging Initiative study.

在线性模型中,省略一个与模型中协变量正交的协变量不会导致有偏差的系数估计。但纵向数据一般不存在这种情况,除了忽略的纵向协变量与模型中的纵向协变量之间的正交性之外,还需要额外的假设才能获得无偏的系数估计。我们提出了在较弱假设条件下减轻遗漏变量偏差的方法。我们提出了一个两步估计程序,用于在观测到非同步纵向协变量时推断此类协变量。对于混合同步和非同步纵向协变量,我们通过两步法得到了同步纵向协变量系数估计的参数收敛率。大量的模拟研究为理论结论提供了数值支持。我们在阿尔茨海默病神经影像倡议研究的数据集上说明了我们的方法的性能。
{"title":"Regression analysis of longitudinal data with mixed synchronous and asynchronous longitudinal covariates","authors":"Zhuowei Sun ,&nbsp;Hongyuan Cao ,&nbsp;Li Chen ,&nbsp;Jason P. Fine","doi":"10.1016/j.jspi.2023.106135","DOIUrl":"https://doi.org/10.1016/j.jspi.2023.106135","url":null,"abstract":"<div><p><span>In linear models, omitting a covariate<span><span> that is orthogonal to covariates in the model does not result in biased coefficient estimation. This generally does not hold for longitudinal data, where additional assumptions are needed to get an unbiased coefficient estimation in addition to the </span>orthogonality<span> between omitted longitudinal covariates and longitudinal covariates in the model. We propose methods to mitigate the omitted variable bias under weaker assumptions. A two-step estimation procedure is proposed to infer the asynchronous longitudinal covariates when such covariates are observed. For mixed synchronous and asynchronous longitudinal covariates, we get a </span></span></span>parametric convergence rate for the coefficient estimation of the synchronous longitudinal covariates by the two-step method. Extensive simulation studies provide numerical support for the theoretical findings. We illustrate the performance of our method on a dataset from the Alzheimer’s Disease Neuroimaging Initiative study.</p></div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":null,"pages":null},"PeriodicalIF":0.9,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138564584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Statistical Planning and Inference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1