首页 > 最新文献

arXiv - STAT - Methodology最新文献

英文 中文
Poisson approximate likelihood compared to the particle filter 泊松近似似然与粒子滤波的比较
Pub Date : 2024-09-18 DOI: arxiv-2409.12173
Yize Hao, Aaron A. Abkemeier, Edward L. Ionides
Filtering algorithms are fundamental for inference on partially observedstochastic dynamic systems, since they provide access to the likelihoodfunction and hence enable likelihood-based or Bayesian inference. A novelPoisson approximate likelihood (PAL) filter was introduced by Whitehouse et al.(2023). PAL employs a Poisson approximation to conditional densities, offeringa fast approximation to the likelihood function for a certain subset ofpartially observed Markov process models. A central piece of evidence for PALis the comparison in Table 1 of Whitehouse et al. (2023), which claims a largeimprovement for PAL over a standard particle filter algorithm. This evidence,based on a model and data from a previous scientific study by Stocks et al.(2020), might suggest that researchers confronted with similar models shoulduse PAL rather than particle filter methods. Taken at face value, this evidencealso reduces the credibility of Stocks et al. (2020) by indicating ashortcoming with the numerical methods that they used. However, we show thatthe comparison of log-likelihood values made by Whitehouse et al. (2023) isflawed because their PAL calculations were carried out using a dataset scaleddifferently from the previous study. If PAL and the particle filter are appliedto the same data, the advantage claimed for PAL disappears. On simulationswhere the model is correctly specified, the particle filter outperforms PAL.
滤波算法是部分观测随机动态系统推断的基础,因为它们提供了对似然函数的访问,从而实现基于似然或贝叶斯的推断。怀特豪斯等人(2023 年)提出了一种新型泊松近似似然(PAL)滤波器。PAL 对条件密度采用泊松近似,为部分观测的马尔可夫过程模型的某些子集提供了快速近似似然函数。怀特豪斯等人(2023 年)在表 1 中对 PAL 进行了比较,认为 PAL 比标准粒子滤波算法有很大改进。这一证据基于 Stocks 等人(2020 年)以前的一项科学研究中的模型和数据,可能表明研究人员在面对类似模型时应使用 PAL 而不是粒子滤波方法。从表面价值来看,这一证据也降低了斯托克斯等人(2020 年)的可信度,因为它表明他们使用的数值方法存在缺陷。然而,我们发现怀特豪斯等人(2023 年)的对数似然值比较存在缺陷,因为他们的 PAL 计算使用的数据集比例与前一项研究不同。如果将 PAL 和粒子过滤器应用于相同的数据,那么 PAL 的优势就会消失。在正确指定模型的模拟中,粒子滤波器的性能优于 PAL。
{"title":"Poisson approximate likelihood compared to the particle filter","authors":"Yize Hao, Aaron A. Abkemeier, Edward L. Ionides","doi":"arxiv-2409.12173","DOIUrl":"https://doi.org/arxiv-2409.12173","url":null,"abstract":"Filtering algorithms are fundamental for inference on partially observed\u0000stochastic dynamic systems, since they provide access to the likelihood\u0000function and hence enable likelihood-based or Bayesian inference. A novel\u0000Poisson approximate likelihood (PAL) filter was introduced by Whitehouse et al.\u0000(2023). PAL employs a Poisson approximation to conditional densities, offering\u0000a fast approximation to the likelihood function for a certain subset of\u0000partially observed Markov process models. A central piece of evidence for PAL\u0000is the comparison in Table 1 of Whitehouse et al. (2023), which claims a large\u0000improvement for PAL over a standard particle filter algorithm. This evidence,\u0000based on a model and data from a previous scientific study by Stocks et al.\u0000(2020), might suggest that researchers confronted with similar models should\u0000use PAL rather than particle filter methods. Taken at face value, this evidence\u0000also reduces the credibility of Stocks et al. (2020) by indicating a\u0000shortcoming with the numerical methods that they used. However, we show that\u0000the comparison of log-likelihood values made by Whitehouse et al. (2023) is\u0000flawed because their PAL calculations were carried out using a dataset scaled\u0000differently from the previous study. If PAL and the particle filter are applied\u0000to the same data, the advantage claimed for PAL disappears. On simulations\u0000where the model is correctly specified, the particle filter outperforms PAL.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"44 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimising the Trade-Off Between Type I and Type II Errors: A Review and Extensions 优化 I 型和 II 型误差之间的权衡:回顾与扩展
Pub Date : 2024-09-18 DOI: arxiv-2409.12081
Andrew P Grieve
In clinical studies upon which decisions are based there are two types oferrors that can be made: a type I error arises when the decision is taken todeclare a positive outcome when the truth is in fact negative, and a type IIerror arises when the decision is taken to declare a negative outcome when thetruth is in fact positive. Commonly the primary analysis of such a studyentails a two-sided hypothesis test with a type I error rate of 5% and thestudy is designed to have a sufficiently low type II error rate, for example10% or 20%. These values are arbitrary and often do not reflect the clinical,or regulatory, context of the study and ignore both the relative costs ofmaking either type of error and the sponsor's prior belief that the drug issuperior to either placebo, or a standard of care if relevant. This simplisticapproach has recently been challenged by numerous authors both from afrequentist and Bayesian perspective since when resources are constrained therewill be a need to consider a trade-off between type I and type II errors. Inthis paper we review proposals to utilise the trade-off by formallyacknowledging the costs to optimise the choice of error rates for simple, pointnull and alternative hypotheses and extend the results to composite, orinterval hypotheses, showing links to the Probability of Success of a clinicalstudy.
在作为决策依据的临床研究中,可能会出现两种类型的错误:当决定宣布阳性结果时,如果真相实际上是阴性,就会出现 I 型错误;当决定宣布阴性结果时,如果真相实际上是阳性,就会出现 II 型错误。通常情况下,此类研究的初步分析需要进行双侧假设检验,I 型错误率为 5%,研究的 II 型错误率要足够低,例如 10%或 20%。这些数值都是任意设定的,通常不能反映研究的临床或监管背景,而且忽略了出现任一类型错误的相对成本,以及申办者事先认为药物优于安慰剂或相关护理标准的信念。这种简单化的方法最近受到了许多学者的质疑,无论是从频繁论者还是贝叶斯论者的角度来看,因为当资源有限时,就需要考虑 I 类和 II 类错误之间的权衡。在本文中,我们回顾了利用这种权衡的建议,即通过正式确认成本来优化简单假说、点空假说和替代假说的错误率选择,并将结果扩展到复合假说或区间假说,显示与临床研究成功概率的联系。
{"title":"Optimising the Trade-Off Between Type I and Type II Errors: A Review and Extensions","authors":"Andrew P Grieve","doi":"arxiv-2409.12081","DOIUrl":"https://doi.org/arxiv-2409.12081","url":null,"abstract":"In clinical studies upon which decisions are based there are two types of\u0000errors that can be made: a type I error arises when the decision is taken to\u0000declare a positive outcome when the truth is in fact negative, and a type II\u0000error arises when the decision is taken to declare a negative outcome when the\u0000truth is in fact positive. Commonly the primary analysis of such a study\u0000entails a two-sided hypothesis test with a type I error rate of 5% and the\u0000study is designed to have a sufficiently low type II error rate, for example\u000010% or 20%. These values are arbitrary and often do not reflect the clinical,\u0000or regulatory, context of the study and ignore both the relative costs of\u0000making either type of error and the sponsor's prior belief that the drug is\u0000superior to either placebo, or a standard of care if relevant. This simplistic\u0000approach has recently been challenged by numerous authors both from a\u0000frequentist and Bayesian perspective since when resources are constrained there\u0000will be a need to consider a trade-off between type I and type II errors. In\u0000this paper we review proposals to utilise the trade-off by formally\u0000acknowledging the costs to optimise the choice of error rates for simple, point\u0000null and alternative hypotheses and extend the results to composite, or\u0000interval hypotheses, showing links to the Probability of Success of a clinical\u0000study.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"123 14 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bias Reduction in Matched Observational Studies with Continuous Treatments: Calipered Non-Bipartite Matching and Bias-Corrected Estimation and Inference 在连续治疗的匹配观察研究中减少偏差:Calipered Non-Bipartite Matching 和偏差校正估计与推论
Pub Date : 2024-09-18 DOI: arxiv-2409.11701
Anthony Frazier, Siyu Heng, Wen Zhou
Matching is a commonly used causal inference framework in observationalstudies. By pairing individuals with different treatment values but with thesame values of covariates (i.e., exact matching), the sample average treatmenteffect (SATE) can be consistently estimated and inferred using the classicNeyman-type (difference-in-means) estimator and confidence interval. However,inexact matching typically exists in practice and may cause substantial biasfor the downstream treatment effect estimation and inference. Many methods havebeen proposed to reduce bias due to inexact matching in the binary treatmentcase. However, to our knowledge, no existing work has systematicallyinvestigated bias due to inexact matching in the continuous treatment case. Tofill this blank, we propose a general framework for reducing bias in inexactlymatched observational studies with continuous treatments. In the matchingstage, we propose a carefully formulated caliper that incorporates theinformation of both the paired covariates and treatment doses to better tailormatching for the downstream SATE estimation and inference. In the estimationand inference stage, we propose a bias-corrected Neyman estimator paired withthe corresponding bias-corrected variance estimator to leverage the informationon propensity density discrepancies after inexact matching to further reducethe bias due to inexact matching. We apply our proposed framework to COVID-19social mobility data to showcase differences between classic and bias-correctedSATE estimation and inference.
匹配是观察性研究中常用的因果推断框架。通过将治疗值不同但协变量值相同的个体配对(即精确配对),可以使用经典的内曼式(均值差)估计器和置信区间对样本平均治疗效果(SATE)进行一致的估计和推断。然而,在实践中通常存在不完全匹配的情况,这可能会对下游治疗效果的估计和推断造成很大偏差。已经提出了许多方法来减少二元治疗案例中的非精确匹配造成的偏差。然而,据我们所知,目前还没有任何工作系统地研究过连续治疗情况下的非精确匹配导致的偏差。为了填补这一空白,我们提出了一个减少连续治疗非精确匹配观察研究偏差的总体框架。在匹配阶段,我们提出了一个精心制定的卡尺,将配对协变量和治疗剂量的信息纳入其中,为下游的 SATE 估计和推断提供更好的尾匹配。在估计和推断阶段,我们提出了一个偏差校正的内曼估计器,并配以相应的偏差校正方差估计器,以利用不完全匹配后的倾向密度差异信息,进一步减少不完全匹配导致的偏差。我们将提出的框架应用于 COVID-19 社会流动性数据,以展示经典和偏差校正 SATE 估计和推断之间的差异。
{"title":"Bias Reduction in Matched Observational Studies with Continuous Treatments: Calipered Non-Bipartite Matching and Bias-Corrected Estimation and Inference","authors":"Anthony Frazier, Siyu Heng, Wen Zhou","doi":"arxiv-2409.11701","DOIUrl":"https://doi.org/arxiv-2409.11701","url":null,"abstract":"Matching is a commonly used causal inference framework in observational\u0000studies. By pairing individuals with different treatment values but with the\u0000same values of covariates (i.e., exact matching), the sample average treatment\u0000effect (SATE) can be consistently estimated and inferred using the classic\u0000Neyman-type (difference-in-means) estimator and confidence interval. However,\u0000inexact matching typically exists in practice and may cause substantial bias\u0000for the downstream treatment effect estimation and inference. Many methods have\u0000been proposed to reduce bias due to inexact matching in the binary treatment\u0000case. However, to our knowledge, no existing work has systematically\u0000investigated bias due to inexact matching in the continuous treatment case. To\u0000fill this blank, we propose a general framework for reducing bias in inexactly\u0000matched observational studies with continuous treatments. In the matching\u0000stage, we propose a carefully formulated caliper that incorporates the\u0000information of both the paired covariates and treatment doses to better tailor\u0000matching for the downstream SATE estimation and inference. In the estimation\u0000and inference stage, we propose a bias-corrected Neyman estimator paired with\u0000the corresponding bias-corrected variance estimator to leverage the information\u0000on propensity density discrepancies after inexact matching to further reduce\u0000the bias due to inexact matching. We apply our proposed framework to COVID-19\u0000social mobility data to showcase differences between classic and bias-corrected\u0000SATE estimation and inference.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting age distribution of life-table death counts via α-transformation 通过α变换预测生命表死亡人数的年龄分布
Pub Date : 2024-09-18 DOI: arxiv-2409.11658
Han Lin Shang, Steven Haberman
We introduce a compositional power transformation, known as an{alpha}-transformation, to model and forecast a time series of life-tabledeath counts, possibly with zero counts observed at older ages. As ageneralisation of the isometric log-ratio transformation (i.e., {alpha} = 0),the {alpha} transformation relies on the tuning parameter {alpha}, which canbe determined in a data-driven manner. Using the Australian age-specific periodlife-table death counts from 1921 to 2020, the {alpha} transformation canproduce more accurate short-term point and interval forecasts than thelog-ratio transformation. The improved forecast accuracy of life-table deathcounts is of great importance to demographers and government planners forestimating survival probabilities and life expectancy and actuaries fordetermining annuity prices and reserves for various initial ages and maturityterms.
我们引入了一种被称为{alpha}变换的组成幂变换,来模拟和预测生命-死亡计数的时间序列,其中可能在较大年龄段观测到零计数。作为等距对数比率变换(即{alpha} = 0)的一般化,{alpha}变换依赖于{alpha}调谐参数,该参数可以通过数据驱动的方式确定。利用澳大利亚从 1921 年到 2020 年特定年龄段的生命表死亡人数,{alpha}变换可以产生比对数变换更准确的短期点预测和区间预测。生命表死亡人数预测准确性的提高,对于人口学家和政府规划人员估计生存概率和预期寿命,以及精算师确定不同初始年龄和成熟期的年金价格和储备金,都具有重要意义。
{"title":"Forecasting age distribution of life-table death counts via α-transformation","authors":"Han Lin Shang, Steven Haberman","doi":"arxiv-2409.11658","DOIUrl":"https://doi.org/arxiv-2409.11658","url":null,"abstract":"We introduce a compositional power transformation, known as an\u0000{alpha}-transformation, to model and forecast a time series of life-table\u0000death counts, possibly with zero counts observed at older ages. As a\u0000generalisation of the isometric log-ratio transformation (i.e., {alpha} = 0),\u0000the {alpha} transformation relies on the tuning parameter {alpha}, which can\u0000be determined in a data-driven manner. Using the Australian age-specific period\u0000life-table death counts from 1921 to 2020, the {alpha} transformation can\u0000produce more accurate short-term point and interval forecasts than the\u0000log-ratio transformation. The improved forecast accuracy of life-table death\u0000counts is of great importance to demographers and government planners for\u0000estimating survival probabilities and life expectancy and actuaries for\u0000determining annuity prices and reserves for various initial ages and maturity\u0000terms.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
E-Values for Exponential Families: the General Case 指数族的 E 值:一般情况
Pub Date : 2024-09-17 DOI: arxiv-2409.11134
Yunda Hao, Peter Grünwald
We analyze common types of e-variables and e-processes for compositeexponential family nulls: the optimal e-variable based on the reverseinformation projection (RIPr), the conditional (COND) e-variable, and theuniversal inference (UI) and sequen-tialized RIPr e-processes. We characterizethe RIPr prior for simple and Bayes-mixture based alternatives, eitherprecisely (for Gaussian nulls and alternatives) or in an approximate sense(general exponential families). We provide conditions under which the RIPre-variable is (again exactly vs. approximately) equal to the COND e-variable.Based on these and other interrelations which we establish, we determine thee-power of the four e-statistics as a function of sample size, exactly forGaussian and up to $o(1)$ in general. For $d$-dimensional null and alternative,the e-power of UI tends to be smaller by a term of $(d/2) log n + O(1)$ thanthat of the COND e-variable, which is the clear winner.
我们分析了复合指数族空值的常见电子变量和电子过程类型:基于反向信息投影(RIPr)的最优电子变量、条件(COND)电子变量、通用推理(UI)和序列化 RIPr 电子过程。我们描述了基于简单和贝叶斯混合物的 RIPr 先验的特征,无论是精确的(高斯空值和替代变量)还是近似意义上的(一般指数族)。我们提供了 RIPre 变量与 COND e 变量(同样是精确与近似)相等的条件。基于我们建立的这些及其他相互关系,我们确定了四个 e 统计量与样本量的函数关系,精确用于高斯,一般可达 $o(1)$。对于 $d$ 维的 null 和 alternative,UI 的 e-power 往往比 COND e-variable 的 e-power 小 $(d/2) log n + O(1)$,后者是明显的赢家。
{"title":"E-Values for Exponential Families: the General Case","authors":"Yunda Hao, Peter Grünwald","doi":"arxiv-2409.11134","DOIUrl":"https://doi.org/arxiv-2409.11134","url":null,"abstract":"We analyze common types of e-variables and e-processes for composite\u0000exponential family nulls: the optimal e-variable based on the reverse\u0000information projection (RIPr), the conditional (COND) e-variable, and the\u0000universal inference (UI) and sequen-tialized RIPr e-processes. We characterize\u0000the RIPr prior for simple and Bayes-mixture based alternatives, either\u0000precisely (for Gaussian nulls and alternatives) or in an approximate sense\u0000(general exponential families). We provide conditions under which the RIPr\u0000e-variable is (again exactly vs. approximately) equal to the COND e-variable.\u0000Based on these and other interrelations which we establish, we determine the\u0000e-power of the four e-statistics as a function of sample size, exactly for\u0000Gaussian and up to $o(1)$ in general. For $d$-dimensional null and alternative,\u0000the e-power of UI tends to be smaller by a term of $(d/2) log n + O(1)$ than\u0000that of the COND e-variable, which is the clear winner.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"16 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cointegrated Matrix Autoregression Models 协整矩阵自回归模型
Pub Date : 2024-09-17 DOI: arxiv-2409.10860
Zebang Li, Han Xiao
We propose a novel cointegrated autoregressive model for matrix-valued timeseries, with bi-linear cointegrating vectors corresponding to the rows andcolumns of the matrix data. Compared to the traditional cointegration analysis,our proposed matrix cointegration model better preserves the inherent structureof the data and enables corresponding interpretations. To estimate thecointegrating vectors as well as other coefficients, we introduce two types ofestimators based on least squares and maximum likelihood. We investigate theasymptotic properties of the cointegrated matrix autoregressive model under theexistence of trend and establish the asymptotic distributions for thecointegrating vectors, as well as other model parameters. We conduct extensivesimulations to demonstrate its superior performance over traditional methods.In addition, we apply our proposed model to Fama-French portfolios and developa effective pairs trading strategy.
我们为矩阵值时间序列提出了一种新的协整自回归模型,其双线性协整向量与矩阵数据的行和列相对应。与传统的协整分析相比,我们提出的矩阵协整模型更好地保留了数据的内在结构,并能进行相应的解释。为了估计协整向量和其他系数,我们引入了基于最小二乘法和最大似然法的两种估计方法。我们研究了趋势存在下协整矩阵自回归模型的渐近特性,并建立了协整向量以及其他模型参数的渐近分布。此外,我们还将提出的模型应用于法玛-法式投资组合,并开发了有效的配对交易策略。
{"title":"Cointegrated Matrix Autoregression Models","authors":"Zebang Li, Han Xiao","doi":"arxiv-2409.10860","DOIUrl":"https://doi.org/arxiv-2409.10860","url":null,"abstract":"We propose a novel cointegrated autoregressive model for matrix-valued time\u0000series, with bi-linear cointegrating vectors corresponding to the rows and\u0000columns of the matrix data. Compared to the traditional cointegration analysis,\u0000our proposed matrix cointegration model better preserves the inherent structure\u0000of the data and enables corresponding interpretations. To estimate the\u0000cointegrating vectors as well as other coefficients, we introduce two types of\u0000estimators based on least squares and maximum likelihood. We investigate the\u0000asymptotic properties of the cointegrated matrix autoregressive model under the\u0000existence of trend and establish the asymptotic distributions for the\u0000cointegrating vectors, as well as other model parameters. We conduct extensive\u0000simulations to demonstrate its superior performance over traditional methods.\u0000In addition, we apply our proposed model to Fama-French portfolios and develop\u0000a effective pairs trading strategy.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calibrated Multivariate Regression with Localized PIT Mappings 使用本地化 PIT 映射的校准多元回归
Pub Date : 2024-09-17 DOI: arxiv-2409.10855
Lucas Kock, G. S. Rodrigues, Scott A. Sisson, Nadja Klein, David J. Nott
Calibration ensures that predicted uncertainties align with observeduncertainties. While there is an extensive literature on recalibration methodsfor univariate probabilistic forecasts, work on calibration for multivariateforecasts is much more limited. This paper introduces a novel post-hocrecalibration approach that addresses multivariate calibration for potentiallymisspecified models. Our method involves constructing local mappings betweenvectors of marginal probability integral transform values and the space ofobservations, providing a flexible and model free solution applicable tocontinuous, discrete, and mixed responses. We present two versions of ourapproach: one uses K-nearest neighbors, and the other uses normalizing flows.Each method has its own strengths in different situations. We demonstrate theeffectiveness of our approach on two real data applications: recalibrating adeep neural network's currency exchange rate forecast and improving aregression model for childhood malnutrition in India for which the multivariateresponse has both discrete and continuous components.
校准可确保预测的不确定性与观测到的不确定性相一致。关于单变量概率预测的重新校准方法已有大量文献,但关于多变量预测的校准工作则有限得多。本文介绍了一种新颖的事后重新校准方法,可解决潜在不确定模型的多变量校准问题。我们的方法涉及在边际概率积分变换值向量和观测空间之间构建局部映射,提供一种灵活的、不受模型限制的解决方案,适用于连续、离散和混合响应。我们介绍了我们方法的两个版本:一个使用 K 最近邻,另一个使用归一化流。我们在两个实际数据应用中展示了我们方法的有效性:重新校准深度神经网络的汇率预测,以及改进印度儿童营养不良的回归模型,其中多变量响应既有离散成分,也有连续成分。
{"title":"Calibrated Multivariate Regression with Localized PIT Mappings","authors":"Lucas Kock, G. S. Rodrigues, Scott A. Sisson, Nadja Klein, David J. Nott","doi":"arxiv-2409.10855","DOIUrl":"https://doi.org/arxiv-2409.10855","url":null,"abstract":"Calibration ensures that predicted uncertainties align with observed\u0000uncertainties. While there is an extensive literature on recalibration methods\u0000for univariate probabilistic forecasts, work on calibration for multivariate\u0000forecasts is much more limited. This paper introduces a novel post-hoc\u0000recalibration approach that addresses multivariate calibration for potentially\u0000misspecified models. Our method involves constructing local mappings between\u0000vectors of marginal probability integral transform values and the space of\u0000observations, providing a flexible and model free solution applicable to\u0000continuous, discrete, and mixed responses. We present two versions of our\u0000approach: one uses K-nearest neighbors, and the other uses normalizing flows.\u0000Each method has its own strengths in different situations. We demonstrate the\u0000effectiveness of our approach on two real data applications: recalibrating a\u0000deep neural network's currency exchange rate forecast and improving a\u0000regression model for childhood malnutrition in India for which the multivariate\u0000response has both discrete and continuous components.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"44 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of g-estimation approaches for handling symptomatic medication at multiple timepoints in Alzheimer's Disease with a hypothetical strategy 阿尔茨海默病多时间点症状用药的 g 值估算方法与假设策略的比较
Pub Date : 2024-09-17 DOI: arxiv-2409.10943
Florian Lasch, Lorenzo Guizzaro, Wen Wei Loh
For handling intercurrent events in clinical trials, one of the strategiesoutlined in the ICH E9(R1) addendum targets the hypothetical scenario ofnon-occurrence of the intercurrent event. While this strategy is oftenimplemented by setting data after the intercurrent event to missing even ifthey have been collected, g-estimation allows for a more efficient estimationby using the information contained in post-IE data. As the g-estimation methodshave largely developed outside of randomised clinical trials, optimisations forthe application in clinical trials are possible. In this work, we describe andinvestigate the performance of modifications to the established g-estimationmethods, leveraging the assumption that some intercurrent events are expectedto have the same impact on the outcome regardless of the timing of theiroccurrence. In a simulation study in Alzheimer disease, the modifications showa substantial efficiency advantage for the estimation of an estimand thatapplies the hypothetical strategy to the use of symptomatic treatment whileretaining unbiasedness and adequate type I error control.
在处理临床试验中的并发症时,ICH E9(R1)附录中列出的策略之一是针对并发症不发生的假设情况。实施这一策略的方法通常是将并发症发生后的数据设为缺失(即使已经收集到),而 g 估计法可以利用并发症发生后数据中包含的信息进行更有效的估计。由于 g 估计方法主要是在随机临床试验之外发展起来的,因此有可能在临床试验中进行优化应用。在这项工作中,我们描述并研究了对已建立的 g 估计方法进行修改后的性能,这些修改利用了这样一个假设,即无论发生的时间如何,一些并发症都会对结果产生相同的影响。在一项关于阿尔茨海默病的模拟研究中,修改后的方法在估算将假设策略应用于对症治疗的估算值时显示出巨大的效率优势,同时保持了无偏性和充分的 I 型误差控制。
{"title":"Comparison of g-estimation approaches for handling symptomatic medication at multiple timepoints in Alzheimer's Disease with a hypothetical strategy","authors":"Florian Lasch, Lorenzo Guizzaro, Wen Wei Loh","doi":"arxiv-2409.10943","DOIUrl":"https://doi.org/arxiv-2409.10943","url":null,"abstract":"For handling intercurrent events in clinical trials, one of the strategies\u0000outlined in the ICH E9(R1) addendum targets the hypothetical scenario of\u0000non-occurrence of the intercurrent event. While this strategy is often\u0000implemented by setting data after the intercurrent event to missing even if\u0000they have been collected, g-estimation allows for a more efficient estimation\u0000by using the information contained in post-IE data. As the g-estimation methods\u0000have largely developed outside of randomised clinical trials, optimisations for\u0000the application in clinical trials are possible. In this work, we describe and\u0000investigate the performance of modifications to the established g-estimation\u0000methods, leveraging the assumption that some intercurrent events are expected\u0000to have the same impact on the outcome regardless of the timing of their\u0000occurrence. In a simulation study in Alzheimer disease, the modifications show\u0000a substantial efficiency advantage for the estimation of an estimand that\u0000applies the hypothetical strategy to the use of symptomatic treatment while\u0000retaining unbiasedness and adequate type I error control.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven stochastic 3D modeling of the nanoporous binder-conductive additive phase in battery cathodes 电池阴极中纳米多孔粘合剂导电添加剂相的数据驱动随机三维建模
Pub Date : 2024-09-17 DOI: arxiv-2409.11080
Phillip Gräfensteiner, Markus Osenberg, André Hilger, Nicole Bohn, Joachim R. Binder, Ingo Manke, Volker Schmidt, Matthias Neumann
A stochastic 3D modeling approach for the nanoporous binder-conductiveadditive phase in hierarchically structured cathodes of lithium-ion batteriesis presented. The binder-conductive additive phase of these electrodes consistsof carbon black, polyvinylidene difluoride binder and graphite particles. Forits stochastic 3D modeling, a three-step procedure based on methods fromstochastic geometry is used. First, the graphite particles are described by aBoolean model with ellipsoidal grains. Second, the mixture of carbon black andbinder is modeled by an excursion set of a Gaussian random field in thecomplement of the graphite particles. Third, large pore regions within themixture of carbon black and binder are described by a Boolean model withspherical grains. The model parameters are calibrated to 3D image data ofcathodes in lithium-ion batteries acquired by focused ion beam scanningelectron microscopy. Subsequently, model validation is performed by comparingmodel realizations with measured image data in terms of various morphologicaldescriptors that are not used for model fitting. Finally, we use the stochastic3D model for predictive simulations, where we generate virtual, yet realistic,image data of nanoporous binder-conductive additives with varying amounts ofgraphite particles. Based on these virtual nanostructures, we can investigatestructure-property relationships. In particular, we quantitatively study theinfluence of graphite particles on effective transport properties in thenanoporous binder-conductive additive phase, which have a crucial impact onelectrochemical processes in the cathode and thus on the performance of batterycells.
本文介绍了锂离子电池分层结构阴极中纳米多孔粘结导电添加相的随机三维建模方法。这些电极的粘结导电添加相由炭黑、聚偏二氟乙烯粘结剂和石墨颗粒组成。为了对其进行随机三维建模,采用了基于随机几何方法的三步程序。首先,用椭圆形颗粒的布尔模型来描述石墨颗粒。其次,炭黑和粘合剂的混合物由石墨颗粒补充部分的高斯随机场偏移集建模。第三,炭黑和粘合剂混合物中的大孔隙区域由球形颗粒的布尔模型描述。模型参数根据聚焦离子束扫描电子显微镜获取的锂离子电池阴极三维图像数据进行校准。随后,通过比较模型实现值与测量图像数据,对模型进行验证,这些数据包含模型拟合时未使用的各种形态描述符。最后,我们使用随机 3D 模型进行预测模拟,生成具有不同数量石墨颗粒的纳米多孔粘结导电添加剂的虚拟但真实的图像数据。基于这些虚拟纳米结构,我们可以研究结构与性能之间的关系。特别是,我们定量研究了石墨颗粒对纳米多孔粘结剂导电添加剂相中有效传输特性的影响,这些特性对阴极的电化学过程以及电池的性能有着至关重要的影响。
{"title":"Data-driven stochastic 3D modeling of the nanoporous binder-conductive additive phase in battery cathodes","authors":"Phillip Gräfensteiner, Markus Osenberg, André Hilger, Nicole Bohn, Joachim R. Binder, Ingo Manke, Volker Schmidt, Matthias Neumann","doi":"arxiv-2409.11080","DOIUrl":"https://doi.org/arxiv-2409.11080","url":null,"abstract":"A stochastic 3D modeling approach for the nanoporous binder-conductive\u0000additive phase in hierarchically structured cathodes of lithium-ion batteries\u0000is presented. The binder-conductive additive phase of these electrodes consists\u0000of carbon black, polyvinylidene difluoride binder and graphite particles. For\u0000its stochastic 3D modeling, a three-step procedure based on methods from\u0000stochastic geometry is used. First, the graphite particles are described by a\u0000Boolean model with ellipsoidal grains. Second, the mixture of carbon black and\u0000binder is modeled by an excursion set of a Gaussian random field in the\u0000complement of the graphite particles. Third, large pore regions within the\u0000mixture of carbon black and binder are described by a Boolean model with\u0000spherical grains. The model parameters are calibrated to 3D image data of\u0000cathodes in lithium-ion batteries acquired by focused ion beam scanning\u0000electron microscopy. Subsequently, model validation is performed by comparing\u0000model realizations with measured image data in terms of various morphological\u0000descriptors that are not used for model fitting. Finally, we use the stochastic\u00003D model for predictive simulations, where we generate virtual, yet realistic,\u0000image data of nanoporous binder-conductive additives with varying amounts of\u0000graphite particles. Based on these virtual nanostructures, we can investigate\u0000structure-property relationships. In particular, we quantitatively study the\u0000influence of graphite particles on effective transport properties in the\u0000nanoporous binder-conductive additive phase, which have a crucial impact on\u0000electrochemical processes in the cathode and thus on the performance of battery\u0000cells.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"187 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chasing Shadows: How Implausible Assumptions Skew Our Understanding of Causal Estimands 追逐阴影:匪夷所思的假设如何歪曲我们对因果估计的理解
Pub Date : 2024-09-17 DOI: arxiv-2409.11162
Stijn Vansteelandt, Kelly Van Lancker
The ICH E9 (R1) addendum on estimands, coupled with recent advancements incausal inference, has prompted a shift towards using model-free treatmenteffect estimands that are more closely aligned with the underlying scientificquestion. This represents a departure from traditional, model-dependentapproaches where the statistical model often overshadows the inquiry itself.While this shift is a positive development, it has unintentionally led to theprioritization of an estimand's theoretical appeal over its practicallearnability from data under plausible assumptions. We illustrate this byscrutinizing assumptions in the recent clinical trials literature on principalstratum estimands, demonstrating that some popular assumptions are not onlyimplausible but often inevitably violated. We advocate for a more balancedapproach to estimand formulation, one that carefully considers both thescientific relevance and the practical feasibility of estimation underrealistic conditions.
ICH E9 (R1)关于估计值的附录,加上最近因果推断方面的进步,促使人们转向使用与基本科学问题更密切相关的无模型治疗效果估计值。虽然这种转变是一种积极的发展,但它无意中导致了估算指标的理论吸引力优先于其在合理假设下从数据中的实际可学习性。我们通过对近期临床试验文献中有关本底估计值的假设进行细分来说明这一点,证明一些流行的假设不仅不合理,而且经常不可避免地遭到违反。我们主张采用更加平衡的方法来制定估计值,即在现实条件下仔细考虑估计值的科学相关性和实际可行性。
{"title":"Chasing Shadows: How Implausible Assumptions Skew Our Understanding of Causal Estimands","authors":"Stijn Vansteelandt, Kelly Van Lancker","doi":"arxiv-2409.11162","DOIUrl":"https://doi.org/arxiv-2409.11162","url":null,"abstract":"The ICH E9 (R1) addendum on estimands, coupled with recent advancements in\u0000causal inference, has prompted a shift towards using model-free treatment\u0000effect estimands that are more closely aligned with the underlying scientific\u0000question. This represents a departure from traditional, model-dependent\u0000approaches where the statistical model often overshadows the inquiry itself.\u0000While this shift is a positive development, it has unintentionally led to the\u0000prioritization of an estimand's theoretical appeal over its practical\u0000learnability from data under plausible assumptions. We illustrate this by\u0000scrutinizing assumptions in the recent clinical trials literature on principal\u0000stratum estimands, demonstrating that some popular assumptions are not only\u0000implausible but often inevitably violated. We advocate for a more balanced\u0000approach to estimand formulation, one that carefully considers both the\u0000scientific relevance and the practical feasibility of estimation under\u0000realistic conditions.","PeriodicalId":501425,"journal":{"name":"arXiv - STAT - Methodology","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142256375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
arXiv - STAT - Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1