{"title":"稀疏随机矩阵的谱大偏差","authors":"Shirshendu Ganguly, Ella Hiesmayr, Kyeongsik Nam","doi":"10.1112/jlms.12954","DOIUrl":null,"url":null,"abstract":"<p>Eigenvalues of Wigner matrices has been a major topic of investigation. A particularly important subclass of such random matrices, useful in many applications, are what are known as sparse or diluted random matrices, where each entry in a Wigner matrix is multiplied by an independent Bernoulli random variable with mean <span></span><math>\n <semantics>\n <mi>p</mi>\n <annotation>$p$</annotation>\n </semantics></math>. Alternatively, such a matrix can be viewed as the adjacency matrix of an Erdős–Rényi graph <span></span><math>\n <semantics>\n <msub>\n <mi>G</mi>\n <mrow>\n <mi>n</mi>\n <mo>,</mo>\n <mi>p</mi>\n </mrow>\n </msub>\n <annotation>$\\mathcal {G}_{n,p}$</annotation>\n </semantics></math> equipped with independent and identically distributed (i.i.d.) edge-weights. An observable of particular interest is the largest eigenvalue. In this paper, we study the large deviations behavior of the largest eigenvalue of such matrices, a topic that has received considerable attention over the years. While certain techniques have been devised for the case when <span></span><math>\n <semantics>\n <mi>p</mi>\n <annotation>$p$</annotation>\n </semantics></math> is fixed or perhaps going to zero not too fast with the matrix size, we focus on the case <span></span><math>\n <semantics>\n <mrow>\n <mi>p</mi>\n <mo>=</mo>\n <mfrac>\n <mi>d</mi>\n <mi>n</mi>\n </mfrac>\n </mrow>\n <annotation>$p = \\frac{d}{n}$</annotation>\n </semantics></math>, that is, constant average degree regime of sparsity, which is a central example due to its connections to many models in statistical mechanics and other applications. Most known techniques break down in this regime and even the typical behavior of the spectrum of such random matrices is not very well understood. So far, results were known only for the Erdős–Rényi graph <span></span><math>\n <semantics>\n <msub>\n <mi>G</mi>\n <mrow>\n <mi>n</mi>\n <mo>,</mo>\n <mfrac>\n <mi>d</mi>\n <mi>n</mi>\n </mfrac>\n </mrow>\n </msub>\n <annotation>$\\mathcal {G}_{n,\\frac{d}{n}}$</annotation>\n </semantics></math> <i>without</i> edge-weights and with <i>Gaussian</i> edge-weights. In the present article, we consider the effect of general weight distributions. More specifically, we consider entry distributions whose tail probabilities decay at rate <span></span><math>\n <semantics>\n <msup>\n <mi>e</mi>\n <mrow>\n <mo>−</mo>\n <msup>\n <mi>t</mi>\n <mi>α</mi>\n </msup>\n </mrow>\n </msup>\n <annotation>$e^{-t^\\alpha }$</annotation>\n </semantics></math> with <span></span><math>\n <semantics>\n <mrow>\n <mi>α</mi>\n <mo>></mo>\n <mn>0</mn>\n </mrow>\n <annotation>$\\alpha &gt;0$</annotation>\n </semantics></math>, where the regimes <span></span><math>\n <semantics>\n <mrow>\n <mn>0</mn>\n <mo><</mo>\n <mi>α</mi>\n <mo><</mo>\n <mn>2</mn>\n </mrow>\n <annotation>$0&lt;\\alpha &lt; 2$</annotation>\n </semantics></math> and <span></span><math>\n <semantics>\n <mrow>\n <mi>α</mi>\n <mo>></mo>\n <mn>2</mn>\n </mrow>\n <annotation>$\\alpha &gt; 2$</annotation>\n </semantics></math> correspond to tails heavier and lighter than the Gaussian tail, respectively. While in many natural settings the large deviations behavior is expected to depend crucially on the entry distribution, we establish a surprising and rare universal behavior showing that this is not the case when <span></span><math>\n <semantics>\n <mrow>\n <mi>α</mi>\n <mo>></mo>\n <mn>2</mn>\n </mrow>\n <annotation>$\\alpha &gt; 2$</annotation>\n </semantics></math>. In contrast, in the <span></span><math>\n <semantics>\n <mrow>\n <mi>α</mi>\n <mo><</mo>\n <mn>2</mn>\n </mrow>\n <annotation>$\\alpha &lt; 2$</annotation>\n </semantics></math> case, the large deviation rate function is no longer universal and is given by the solution to a variational problem, the description of which involves a generalization of the Motzkin–Straus theorem, a classical result from spectral graph theory. As a byproduct of our large deviation results, we also establish the law of large numbers behavior for the largest eigenvalue, which also seems to be new and difficult to obtain using existing methods. In particular, we show that the typical value of the largest eigenvalue exhibits a phase transition at <span></span><math>\n <semantics>\n <mrow>\n <mi>α</mi>\n <mo>=</mo>\n <mn>2</mn>\n </mrow>\n <annotation>$\\alpha = 2$</annotation>\n </semantics></math>, that is, corresponding to the Gaussian distribution.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2024-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spectral large deviations of sparse random matrices\",\"authors\":\"Shirshendu Ganguly, Ella Hiesmayr, Kyeongsik Nam\",\"doi\":\"10.1112/jlms.12954\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Eigenvalues of Wigner matrices has been a major topic of investigation. A particularly important subclass of such random matrices, useful in many applications, are what are known as sparse or diluted random matrices, where each entry in a Wigner matrix is multiplied by an independent Bernoulli random variable with mean <span></span><math>\\n <semantics>\\n <mi>p</mi>\\n <annotation>$p$</annotation>\\n </semantics></math>. Alternatively, such a matrix can be viewed as the adjacency matrix of an Erdős–Rényi graph <span></span><math>\\n <semantics>\\n <msub>\\n <mi>G</mi>\\n <mrow>\\n <mi>n</mi>\\n <mo>,</mo>\\n <mi>p</mi>\\n </mrow>\\n </msub>\\n <annotation>$\\\\mathcal {G}_{n,p}$</annotation>\\n </semantics></math> equipped with independent and identically distributed (i.i.d.) edge-weights. An observable of particular interest is the largest eigenvalue. In this paper, we study the large deviations behavior of the largest eigenvalue of such matrices, a topic that has received considerable attention over the years. While certain techniques have been devised for the case when <span></span><math>\\n <semantics>\\n <mi>p</mi>\\n <annotation>$p$</annotation>\\n </semantics></math> is fixed or perhaps going to zero not too fast with the matrix size, we focus on the case <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>p</mi>\\n <mo>=</mo>\\n <mfrac>\\n <mi>d</mi>\\n <mi>n</mi>\\n </mfrac>\\n </mrow>\\n <annotation>$p = \\\\frac{d}{n}$</annotation>\\n </semantics></math>, that is, constant average degree regime of sparsity, which is a central example due to its connections to many models in statistical mechanics and other applications. Most known techniques break down in this regime and even the typical behavior of the spectrum of such random matrices is not very well understood. So far, results were known only for the Erdős–Rényi graph <span></span><math>\\n <semantics>\\n <msub>\\n <mi>G</mi>\\n <mrow>\\n <mi>n</mi>\\n <mo>,</mo>\\n <mfrac>\\n <mi>d</mi>\\n <mi>n</mi>\\n </mfrac>\\n </mrow>\\n </msub>\\n <annotation>$\\\\mathcal {G}_{n,\\\\frac{d}{n}}$</annotation>\\n </semantics></math> <i>without</i> edge-weights and with <i>Gaussian</i> edge-weights. In the present article, we consider the effect of general weight distributions. More specifically, we consider entry distributions whose tail probabilities decay at rate <span></span><math>\\n <semantics>\\n <msup>\\n <mi>e</mi>\\n <mrow>\\n <mo>−</mo>\\n <msup>\\n <mi>t</mi>\\n <mi>α</mi>\\n </msup>\\n </mrow>\\n </msup>\\n <annotation>$e^{-t^\\\\alpha }$</annotation>\\n </semantics></math> with <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n <mo>></mo>\\n <mn>0</mn>\\n </mrow>\\n <annotation>$\\\\alpha &gt;0$</annotation>\\n </semantics></math>, where the regimes <span></span><math>\\n <semantics>\\n <mrow>\\n <mn>0</mn>\\n <mo><</mo>\\n <mi>α</mi>\\n <mo><</mo>\\n <mn>2</mn>\\n </mrow>\\n <annotation>$0&lt;\\\\alpha &lt; 2$</annotation>\\n </semantics></math> and <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n <mo>></mo>\\n <mn>2</mn>\\n </mrow>\\n <annotation>$\\\\alpha &gt; 2$</annotation>\\n </semantics></math> correspond to tails heavier and lighter than the Gaussian tail, respectively. While in many natural settings the large deviations behavior is expected to depend crucially on the entry distribution, we establish a surprising and rare universal behavior showing that this is not the case when <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n <mo>></mo>\\n <mn>2</mn>\\n </mrow>\\n <annotation>$\\\\alpha &gt; 2$</annotation>\\n </semantics></math>. In contrast, in the <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n <mo><</mo>\\n <mn>2</mn>\\n </mrow>\\n <annotation>$\\\\alpha &lt; 2$</annotation>\\n </semantics></math> case, the large deviation rate function is no longer universal and is given by the solution to a variational problem, the description of which involves a generalization of the Motzkin–Straus theorem, a classical result from spectral graph theory. As a byproduct of our large deviation results, we also establish the law of large numbers behavior for the largest eigenvalue, which also seems to be new and difficult to obtain using existing methods. In particular, we show that the typical value of the largest eigenvalue exhibits a phase transition at <span></span><math>\\n <semantics>\\n <mrow>\\n <mi>α</mi>\\n <mo>=</mo>\\n <mn>2</mn>\\n </mrow>\\n <annotation>$\\\\alpha = 2$</annotation>\\n </semantics></math>, that is, corresponding to the Gaussian distribution.</p>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2024-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1112/jlms.12954\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"100","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1112/jlms.12954","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
维格纳矩阵的特征值一直是研究的主要课题。这类随机矩阵的一个特别重要的子类是所谓的稀疏或稀释随机矩阵,其中维格纳矩阵的每个条目都乘以一个均值为 p $p$ 的独立伯努利随机变量,在许多应用中都非常有用。或者,这样的矩阵可以被看作是厄尔多斯-雷尼图 G n , p $mathcal {G}_{n,p}$ 的邻接矩阵,它配备了独立且同分布(i.i.d.)的边权重。最大特征值是一个特别值得关注的观测值。在本文中,我们将研究此类矩阵最大特征值的大偏差行为,这是多年来备受关注的一个课题。虽然我们已经针对 p $p$ 固定或可能随矩阵大小快速归零的情况设计了某些技术,但我们重点研究 p = d n $p = \frac{d}{n}$ 的情况,即平均度恒定的稀疏性机制,由于它与统计力学和其他应用中的许多模型有关,因此是一个核心例子。大多数已知技术都会在这一机制中崩溃,甚至连此类随机矩阵频谱的典型行为都不甚了解。迄今为止,我们只知道 Erdős-Rényi 图 G n , d n $mathcal {G}_{n,\frac{d}{n}$ 没有边权重和有高斯边权重的结果。在本文中,我们将考虑一般权重分布的影响。更具体地说,我们考虑的条目分布的尾部概率衰减率为 e - t α $e^{-t^\alpha }$,α > 0 $\alpha >0$ ,其中 0 < α < 2 $0<\alpha < 2$ 和 α > 2 $\alpha > 2$ 分别对应于比高斯尾部更重和更轻的尾部。虽然在许多自然环境中,大偏差行为预计会在很大程度上取决于入口分布,但我们建立了一个令人惊讶且罕见的普遍行为,表明当 α > 2 $\alpha > 2$ 时情况并非如此。相反,在 α < 2 $\alpha < 2$ 的情况下,大偏差率函数不再具有普遍性,而是由一个变异问题的解给出的,对该问题的描述涉及莫茨金-斯特劳斯定理(Motzkin-Straus theorem)的一般化,这是光谱图理论的一个经典结果。作为大偏差结果的副产品,我们还建立了最大特征值的大数定律行为,这似乎也是一个新问题,而且难以用现有方法获得。特别是,我们证明了最大特征值的典型值在 α = 2 $\alpha = 2$ 时出现相变,即对应于高斯分布。
Spectral large deviations of sparse random matrices
Eigenvalues of Wigner matrices has been a major topic of investigation. A particularly important subclass of such random matrices, useful in many applications, are what are known as sparse or diluted random matrices, where each entry in a Wigner matrix is multiplied by an independent Bernoulli random variable with mean . Alternatively, such a matrix can be viewed as the adjacency matrix of an Erdős–Rényi graph equipped with independent and identically distributed (i.i.d.) edge-weights. An observable of particular interest is the largest eigenvalue. In this paper, we study the large deviations behavior of the largest eigenvalue of such matrices, a topic that has received considerable attention over the years. While certain techniques have been devised for the case when is fixed or perhaps going to zero not too fast with the matrix size, we focus on the case , that is, constant average degree regime of sparsity, which is a central example due to its connections to many models in statistical mechanics and other applications. Most known techniques break down in this regime and even the typical behavior of the spectrum of such random matrices is not very well understood. So far, results were known only for the Erdős–Rényi graph without edge-weights and with Gaussian edge-weights. In the present article, we consider the effect of general weight distributions. More specifically, we consider entry distributions whose tail probabilities decay at rate with , where the regimes and correspond to tails heavier and lighter than the Gaussian tail, respectively. While in many natural settings the large deviations behavior is expected to depend crucially on the entry distribution, we establish a surprising and rare universal behavior showing that this is not the case when . In contrast, in the case, the large deviation rate function is no longer universal and is given by the solution to a variational problem, the description of which involves a generalization of the Motzkin–Straus theorem, a classical result from spectral graph theory. As a byproduct of our large deviation results, we also establish the law of large numbers behavior for the largest eigenvalue, which also seems to be new and difficult to obtain using existing methods. In particular, we show that the typical value of the largest eigenvalue exhibits a phase transition at , that is, corresponding to the Gaussian distribution.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.