Julian Dörfler, Benito van der Zander, Markus Bläser, Maciej Liskiewicz
{"title":"On the Complexity of Identification in Linear Structural Causal Models","authors":"Julian Dörfler, Benito van der Zander, Markus Bläser, Maciej Liskiewicz","doi":"arxiv-2407.12528","DOIUrl":null,"url":null,"abstract":"Learning the unknown causal parameters of a linear structural causal model is\na fundamental task in causal analysis. The task, known as the problem of\nidentification, asks to estimate the parameters of the model from a combination\nof assumptions on the graphical structure of the model and observational data,\nrepresented as a non-causal covariance matrix. In this paper, we give a new\nsound and complete algorithm for generic identification which runs in\npolynomial space. By standard simulation results, this algorithm has\nexponential running time which vastly improves the state-of-the-art double\nexponential time method using a Gr\\\"obner basis approach. The paper also\npresents evidence that parameter identification is computationally hard in\ngeneral. In particular, we prove, that the task asking whether, for a given\nfeasible correlation matrix, there are exactly one or two or more parameter\nsets explaining the observed matrix, is hard for $\\forall R$, the co-class of\nthe existential theory of the reals. In particular, this problem is\n$coNP$-hard. To our best knowledge, this is the first hardness result for some\nnotion of identifiability.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":"95 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computational Complexity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.12528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Learning the unknown causal parameters of a linear structural causal model is
a fundamental task in causal analysis. The task, known as the problem of
identification, asks to estimate the parameters of the model from a combination
of assumptions on the graphical structure of the model and observational data,
represented as a non-causal covariance matrix. In this paper, we give a new
sound and complete algorithm for generic identification which runs in
polynomial space. By standard simulation results, this algorithm has
exponential running time which vastly improves the state-of-the-art double
exponential time method using a Gr\"obner basis approach. The paper also
presents evidence that parameter identification is computationally hard in
general. In particular, we prove, that the task asking whether, for a given
feasible correlation matrix, there are exactly one or two or more parameter
sets explaining the observed matrix, is hard for $\forall R$, the co-class of
the existential theory of the reals. In particular, this problem is
$coNP$-hard. To our best knowledge, this is the first hardness result for some
notion of identifiability.