ZuhengDavid, Xu, Moksh Jain, Ali Denton, Shawn Whitfield, Aniket Didolkar, Berton Earnshaw, Jason Hartford
{"title":"Automated Discovery of Pairwise Interactions from Unstructured Data","authors":"ZuhengDavid, Xu, Moksh Jain, Ali Denton, Shawn Whitfield, Aniket Didolkar, Berton Earnshaw, Jason Hartford","doi":"arxiv-2409.07594","DOIUrl":null,"url":null,"abstract":"Pairwise interactions between perturbations to a system can provide evidence\nfor the causal dependencies of the underlying underlying mechanisms of a\nsystem. When observations are low dimensional, hand crafted measurements,\ndetecting interactions amounts to simple statistical tests, but it is not\nobvious how to detect interactions between perturbations affecting latent\nvariables. We derive two interaction tests that are based on pairwise\ninterventions, and show how these tests can be integrated into an active\nlearning pipeline to efficiently discover pairwise interactions between\nperturbations. We illustrate the value of these tests in the context of\nbiology, where pairwise perturbation experiments are frequently used to reveal\ninteractions that are not observable from any single perturbation. Our tests\ncan be run on unstructured data, such as the pixels in an image, which enables\na more general notion of interaction than typical cell viability experiments,\nand can be run on cheaper experimental assays. We validate on several synthetic\nand real biological experiments that our tests are able to identify interacting\npairs effectively. We evaluate our approach on a real biological experiment\nwhere we knocked out 50 pairs of genes and measured the effect with microscopy\nimages. We show that we are able to recover significantly more known biological\ninteractions than random search and standard active learning baselines.","PeriodicalId":501340,"journal":{"name":"arXiv - STAT - Machine Learning","volume":"45 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07594","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Pairwise interactions between perturbations to a system can provide evidence
for the causal dependencies of the underlying underlying mechanisms of a
system. When observations are low dimensional, hand crafted measurements,
detecting interactions amounts to simple statistical tests, but it is not
obvious how to detect interactions between perturbations affecting latent
variables. We derive two interaction tests that are based on pairwise
interventions, and show how these tests can be integrated into an active
learning pipeline to efficiently discover pairwise interactions between
perturbations. We illustrate the value of these tests in the context of
biology, where pairwise perturbation experiments are frequently used to reveal
interactions that are not observable from any single perturbation. Our tests
can be run on unstructured data, such as the pixels in an image, which enables
a more general notion of interaction than typical cell viability experiments,
and can be run on cheaper experimental assays. We validate on several synthetic
and real biological experiments that our tests are able to identify interacting
pairs effectively. We evaluate our approach on a real biological experiment
where we knocked out 50 pairs of genes and measured the effect with microscopy
images. We show that we are able to recover significantly more known biological
interactions than random search and standard active learning baselines.