{"title":"有噪声的杠杆矩阵完成。","authors":"Xinjian Huang;Weiwei Liu;Bo Du;Dacheng Tao","doi":"10.1109/TCYB.2023.3305552","DOIUrl":null,"url":null,"abstract":"Completing low-rank matrices from subsampled measurements has received much attention in the past decade. Existing works indicate that \n<inline-formula> <tex-math>$\\mathcal {O}(nr\\log ^{2}(n))$ </tex-math></inline-formula>\n datums are required to theoretically secure the completion of an \n<inline-formula> <tex-math>$n \\times n$ </tex-math></inline-formula>\n noisy matrix of rank \n<inline-formula> <tex-math>$r$ </tex-math></inline-formula>\n with high probability, under some quite restrictive assumptions: 1) the underlying matrix must be incoherent and 2) observations follow the uniform distribution. The restrictiveness is partially due to ignoring the roles of the leverage score and the oracle information of each element. In this article, we employ the leverage scores to characterize the importance of each element and significantly relax assumptions to: 1) not any other structure assumptions are imposed on the underlying low-rank matrix and 2) elements being observed are appropriately dependent on their importance via the leverage score. Under these assumptions, instead of uniform sampling, we devise an ununiform/biased sampling procedure that can reveal the “importance” of each observed element. Our proofs are supported by a novel approach that phrases sufficient optimality conditions based on the Golfing scheme, which would be of independent interest to the wider areas. Theoretical findings show that we can provably recover an unknown \n<inline-formula> <tex-math>$n\\times n$ </tex-math></inline-formula>\n matrix of rank \n<inline-formula> <tex-math>$r$ </tex-math></inline-formula>\n from just about \n<inline-formula> <tex-math>$\\mathcal {O}(nr\\log ^{2} (n))$ </tex-math></inline-formula>\n entries, even when the observed entries are corrupted with a small amount of noisy information. The empirical results align precisely with our theories.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"54 8","pages":"4443-4453"},"PeriodicalIF":9.4000,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraged Matrix Completion With Noise\",\"authors\":\"Xinjian Huang;Weiwei Liu;Bo Du;Dacheng Tao\",\"doi\":\"10.1109/TCYB.2023.3305552\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Completing low-rank matrices from subsampled measurements has received much attention in the past decade. Existing works indicate that \\n<inline-formula> <tex-math>$\\\\mathcal {O}(nr\\\\log ^{2}(n))$ </tex-math></inline-formula>\\n datums are required to theoretically secure the completion of an \\n<inline-formula> <tex-math>$n \\\\times n$ </tex-math></inline-formula>\\n noisy matrix of rank \\n<inline-formula> <tex-math>$r$ </tex-math></inline-formula>\\n with high probability, under some quite restrictive assumptions: 1) the underlying matrix must be incoherent and 2) observations follow the uniform distribution. The restrictiveness is partially due to ignoring the roles of the leverage score and the oracle information of each element. In this article, we employ the leverage scores to characterize the importance of each element and significantly relax assumptions to: 1) not any other structure assumptions are imposed on the underlying low-rank matrix and 2) elements being observed are appropriately dependent on their importance via the leverage score. Under these assumptions, instead of uniform sampling, we devise an ununiform/biased sampling procedure that can reveal the “importance” of each observed element. Our proofs are supported by a novel approach that phrases sufficient optimality conditions based on the Golfing scheme, which would be of independent interest to the wider areas. Theoretical findings show that we can provably recover an unknown \\n<inline-formula> <tex-math>$n\\\\times n$ </tex-math></inline-formula>\\n matrix of rank \\n<inline-formula> <tex-math>$r$ </tex-math></inline-formula>\\n from just about \\n<inline-formula> <tex-math>$\\\\mathcal {O}(nr\\\\log ^{2} (n))$ </tex-math></inline-formula>\\n entries, even when the observed entries are corrupted with a small amount of noisy information. The empirical results align precisely with our theories.\",\"PeriodicalId\":13112,\"journal\":{\"name\":\"IEEE Transactions on Cybernetics\",\"volume\":\"54 8\",\"pages\":\"4443-4453\"},\"PeriodicalIF\":9.4000,\"publicationDate\":\"2023-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cybernetics\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10251395/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cybernetics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10251395/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
在过去十年中,通过子采样测量完成低秩矩阵的研究受到了广泛关注。现有研究表明,在一些限制性假设条件下,理论上需要 O(nrlog2(n))个基准点才能高概率地完成一个 n ×n 的秩为 r 的噪声矩阵:1)底层矩阵必须是不连贯的;2)观测结果遵循均匀分布。限制性的部分原因是忽略了每个元素的杠杆分数和甲骨文信息的作用。在本文中,我们利用杠杆分数来描述每个元素的重要性,并大大放宽了以下假设:1) 不对底层低阶矩阵施加任何其他结构假设;2) 被观测元素通过杠杆分数适当地依赖于其重要性。在这些假设条件下,我们设计了一种能揭示每个被观测元素 "重要性 "的非均匀/有偏取样程序,而不是均匀取样。我们的证明得到了一种新方法的支持,这种新方法基于 Golfing 方案给出了充分的最优条件,这对更广泛的领域具有独立的意义。理论研究结果表明,即使观察到的条目受到少量噪声信息的干扰,我们也能从大约 O(nrlog2 (n)) 个条目中恢复出一个秩为 r 的未知 n×n 矩阵。实证结果与我们的理论完全一致。
Completing low-rank matrices from subsampled measurements has received much attention in the past decade. Existing works indicate that
$\mathcal {O}(nr\log ^{2}(n))$
datums are required to theoretically secure the completion of an
$n \times n$
noisy matrix of rank
$r$
with high probability, under some quite restrictive assumptions: 1) the underlying matrix must be incoherent and 2) observations follow the uniform distribution. The restrictiveness is partially due to ignoring the roles of the leverage score and the oracle information of each element. In this article, we employ the leverage scores to characterize the importance of each element and significantly relax assumptions to: 1) not any other structure assumptions are imposed on the underlying low-rank matrix and 2) elements being observed are appropriately dependent on their importance via the leverage score. Under these assumptions, instead of uniform sampling, we devise an ununiform/biased sampling procedure that can reveal the “importance” of each observed element. Our proofs are supported by a novel approach that phrases sufficient optimality conditions based on the Golfing scheme, which would be of independent interest to the wider areas. Theoretical findings show that we can provably recover an unknown
$n\times n$
matrix of rank
$r$
from just about
$\mathcal {O}(nr\log ^{2} (n))$
entries, even when the observed entries are corrupted with a small amount of noisy information. The empirical results align precisely with our theories.
期刊介绍:
The scope of the IEEE Transactions on Cybernetics includes computational approaches to the field of cybernetics. Specifically, the transactions welcomes papers on communication and control across machines or machine, human, and organizations. The scope includes such areas as computational intelligence, computer vision, neural networks, genetic algorithms, machine learning, fuzzy systems, cognitive systems, decision making, and robotics, to the extent that they contribute to the theme of cybernetics or demonstrate an application of cybernetics principles.