Julian Dörfler, Benito van der Zander, Markus Bläser, Maciej Liskiewicz
{"title":"论线性结构因果模型识别的复杂性","authors":"Julian Dörfler, Benito van der Zander, Markus Bläser, Maciej Liskiewicz","doi":"arxiv-2407.12528","DOIUrl":null,"url":null,"abstract":"Learning the unknown causal parameters of a linear structural causal model is\na fundamental task in causal analysis. The task, known as the problem of\nidentification, asks to estimate the parameters of the model from a combination\nof assumptions on the graphical structure of the model and observational data,\nrepresented as a non-causal covariance matrix. In this paper, we give a new\nsound and complete algorithm for generic identification which runs in\npolynomial space. By standard simulation results, this algorithm has\nexponential running time which vastly improves the state-of-the-art double\nexponential time method using a Gr\\\"obner basis approach. The paper also\npresents evidence that parameter identification is computationally hard in\ngeneral. In particular, we prove, that the task asking whether, for a given\nfeasible correlation matrix, there are exactly one or two or more parameter\nsets explaining the observed matrix, is hard for $\\forall R$, the co-class of\nthe existential theory of the reals. In particular, this problem is\n$coNP$-hard. To our best knowledge, this is the first hardness result for some\nnotion of identifiability.","PeriodicalId":501024,"journal":{"name":"arXiv - CS - Computational Complexity","volume":"95 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"On the Complexity of Identification in Linear Structural Causal Models\",\"authors\":\"Julian Dörfler, Benito van der Zander, Markus Bläser, Maciej Liskiewicz\",\"doi\":\"arxiv-2407.12528\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Learning the unknown causal parameters of a linear structural causal model is\\na fundamental task in causal analysis. The task, known as the problem of\\nidentification, asks to estimate the parameters of the model from a combination\\nof assumptions on the graphical structure of the model and observational data,\\nrepresented as a non-causal covariance matrix. In this paper, we give a new\\nsound and complete algorithm for generic identification which runs in\\npolynomial space. By standard simulation results, this algorithm has\\nexponential running time which vastly improves the state-of-the-art double\\nexponential time method using a Gr\\\\\\\"obner basis approach. The paper also\\npresents evidence that parameter identification is computationally hard in\\ngeneral. In particular, we prove, that the task asking whether, for a given\\nfeasible correlation matrix, there are exactly one or two or more parameter\\nsets explaining the observed matrix, is hard for $\\\\forall R$, the co-class of\\nthe existential theory of the reals. In particular, this problem is\\n$coNP$-hard. To our best knowledge, this is the first hardness result for some\\nnotion of identifiability.\",\"PeriodicalId\":501024,\"journal\":{\"name\":\"arXiv - CS - Computational Complexity\",\"volume\":\"95 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Computational Complexity\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.12528\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computational Complexity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.12528","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
On the Complexity of Identification in Linear Structural Causal Models
Learning the unknown causal parameters of a linear structural causal model is
a fundamental task in causal analysis. The task, known as the problem of
identification, asks to estimate the parameters of the model from a combination
of assumptions on the graphical structure of the model and observational data,
represented as a non-causal covariance matrix. In this paper, we give a new
sound and complete algorithm for generic identification which runs in
polynomial space. By standard simulation results, this algorithm has
exponential running time which vastly improves the state-of-the-art double
exponential time method using a Gr\"obner basis approach. The paper also
presents evidence that parameter identification is computationally hard in
general. In particular, we prove, that the task asking whether, for a given
feasible correlation matrix, there are exactly one or two or more parameter
sets explaining the observed matrix, is hard for $\forall R$, the co-class of
the existential theory of the reals. In particular, this problem is
$coNP$-hard. To our best knowledge, this is the first hardness result for some
notion of identifiability.