Christian B. H. Thorjussen, K. H. Liland, Ingrid Måge, Lars Erik Solberg
{"title":"Computational Test for Conditional Independence","authors":"Christian B. H. Thorjussen, K. H. Liland, Ingrid Måge, Lars Erik Solberg","doi":"10.3390/a17080323","DOIUrl":null,"url":null,"abstract":"Conditional Independence (CI) testing is fundamental in statistical analysis. For example, CI testing helps validate causal graphs or longitudinal data analysis with repeated measures in causal inference. CI testing is difficult, especially when testing involves categorical variables conditioned on a mixture of continuous and categorical variables. Current parametric and non-parametric testing methods are designed for continuous variables and can quickly fall short in the categorical case. This paper presents a computational approach for CI testing suited for categorical data types, which we call computational conditional independence (CCI) testing. The test procedure is based on permutation and combines machine learning prediction algorithms and Monte Carlo cross-validation. We evaluated the approach through simulation studies and assessed the performance against alternative methods: the generalized covariance measure test, the kernel conditional independence test, and testing with multinomial regression. We find that the computational approach to testing has utility over the alternative methods, achieving better control over type I error rates. We hope this work can expand the toolkit for CI testing for practitioners and researchers.","PeriodicalId":502609,"journal":{"name":"Algorithms","volume":"14 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Algorithms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/a17080323","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Conditional Independence (CI) testing is fundamental in statistical analysis. For example, CI testing helps validate causal graphs or longitudinal data analysis with repeated measures in causal inference. CI testing is difficult, especially when testing involves categorical variables conditioned on a mixture of continuous and categorical variables. Current parametric and non-parametric testing methods are designed for continuous variables and can quickly fall short in the categorical case. This paper presents a computational approach for CI testing suited for categorical data types, which we call computational conditional independence (CCI) testing. The test procedure is based on permutation and combines machine learning prediction algorithms and Monte Carlo cross-validation. We evaluated the approach through simulation studies and assessed the performance against alternative methods: the generalized covariance measure test, the kernel conditional independence test, and testing with multinomial regression. We find that the computational approach to testing has utility over the alternative methods, achieving better control over type I error rates. We hope this work can expand the toolkit for CI testing for practitioners and researchers.
条件独立性(CI)测试是统计分析的基础。例如,CI 检验有助于验证因果图或因果推断中使用重复测量的纵向数据分析。CI 检验很困难,尤其是当检验涉及以连续变量和分类变量混合为条件的分类变量时。目前的参数和非参数测试方法是为连续变量设计的,在分类情况下很快就会失效。本文提出了一种适用于分类数据类型的 CI 检验计算方法,我们称之为计算条件独立性(CCI)检验。该测试程序基于置换,并结合了机器学习预测算法和蒙特卡罗交叉验证。我们通过模拟研究对该方法进行了评估,并对照其他方法评估了其性能:广义协方差测量检验、核条件独立性检验以及多项式回归检验。我们发现,计算检验方法比其他方法更有用,能更好地控制 I 类错误率。我们希望这项工作能为从业人员和研究人员扩展 CI 检验工具包。