{"title":"相关随机向量的检测","authors":"Dor Elimelech;Wasim Huleihel","doi":"10.1109/TIT.2024.3435008","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate the problem of deciding whether two standard normal random vectors \n<inline-formula> <tex-math>$\\textsf {X}\\in \\mathbb {R}^{n}$ </tex-math></inline-formula>\n and \n<inline-formula> <tex-math>$\\textsf {Y}\\in \\mathbb {R}^{n}$ </tex-math></inline-formula>\n are correlated or not. This is formulated as a hypothesis testing problem, where under the null hypothesis, these vectors are statistically independent, while under the alternative, \n<inline-formula> <tex-math>$\\textsf {X}$ </tex-math></inline-formula>\n and a randomly and uniformly permuted version of \n<inline-formula> <tex-math>$\\textsf {Y}$ </tex-math></inline-formula>\n, are correlated with correlation \n<inline-formula> <tex-math>$\\rho $ </tex-math></inline-formula>\n. We analyze the thresholds at which optimal testing is information-theoretically impossible and possible, as a function of n and \n<inline-formula> <tex-math>$\\rho $ </tex-math></inline-formula>\n. To derive our information-theoretic lower bounds, we develop a novel technique for evaluating the second moment of the likelihood ratio using an orthogonal polynomials expansion, which among other things, reveals a surprising connection to integer partition functions. We also study a multi-dimensional generalization of the above setting, where rather than two vectors we observe two databases/matrices, and furthermore allow for partial correlations between these two.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"70 12","pages":"8942-8960"},"PeriodicalIF":2.2000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detection of Correlated Random Vectors\",\"authors\":\"Dor Elimelech;Wasim Huleihel\",\"doi\":\"10.1109/TIT.2024.3435008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we investigate the problem of deciding whether two standard normal random vectors \\n<inline-formula> <tex-math>$\\\\textsf {X}\\\\in \\\\mathbb {R}^{n}$ </tex-math></inline-formula>\\n and \\n<inline-formula> <tex-math>$\\\\textsf {Y}\\\\in \\\\mathbb {R}^{n}$ </tex-math></inline-formula>\\n are correlated or not. This is formulated as a hypothesis testing problem, where under the null hypothesis, these vectors are statistically independent, while under the alternative, \\n<inline-formula> <tex-math>$\\\\textsf {X}$ </tex-math></inline-formula>\\n and a randomly and uniformly permuted version of \\n<inline-formula> <tex-math>$\\\\textsf {Y}$ </tex-math></inline-formula>\\n, are correlated with correlation \\n<inline-formula> <tex-math>$\\\\rho $ </tex-math></inline-formula>\\n. We analyze the thresholds at which optimal testing is information-theoretically impossible and possible, as a function of n and \\n<inline-formula> <tex-math>$\\\\rho $ </tex-math></inline-formula>\\n. To derive our information-theoretic lower bounds, we develop a novel technique for evaluating the second moment of the likelihood ratio using an orthogonal polynomials expansion, which among other things, reveals a surprising connection to integer partition functions. We also study a multi-dimensional generalization of the above setting, where rather than two vectors we observe two databases/matrices, and furthermore allow for partial correlations between these two.\",\"PeriodicalId\":13494,\"journal\":{\"name\":\"IEEE Transactions on Information Theory\",\"volume\":\"70 12\",\"pages\":\"8942-8960\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Information Theory\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10623497/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Information Theory","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10623497/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
In this paper, we investigate the problem of deciding whether two standard normal random vectors
$\textsf {X}\in \mathbb {R}^{n}$
and
$\textsf {Y}\in \mathbb {R}^{n}$
are correlated or not. This is formulated as a hypothesis testing problem, where under the null hypothesis, these vectors are statistically independent, while under the alternative,
$\textsf {X}$
and a randomly and uniformly permuted version of
$\textsf {Y}$
, are correlated with correlation
$\rho $
. We analyze the thresholds at which optimal testing is information-theoretically impossible and possible, as a function of n and
$\rho $
. To derive our information-theoretic lower bounds, we develop a novel technique for evaluating the second moment of the likelihood ratio using an orthogonal polynomials expansion, which among other things, reveals a surprising connection to integer partition functions. We also study a multi-dimensional generalization of the above setting, where rather than two vectors we observe two databases/matrices, and furthermore allow for partial correlations between these two.
期刊介绍:
The IEEE Transactions on Information Theory is a journal that publishes theoretical and experimental papers concerned with the transmission, processing, and utilization of information. The boundaries of acceptable subject matter are intentionally not sharply delimited. Rather, it is hoped that as the focus of research activity changes, a flexible policy will permit this Transactions to follow suit. Current appropriate topics are best reflected by recent Tables of Contents; they are summarized in the titles of editorial areas that appear on the inside front cover.