SIAM Journal on Matrix Analysis and Applications最新文献

英文中文

Efficient Vectors for Block Perturbed Consistent Matrices 块扰动一致矩阵的高效向量

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-08 DOI: 10.1137/23m1580310

Susana Furtado, Charles Johnson

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 601-618, March 2024.
Abstract. In prioritization schemes, based on pairwise comparisons, such as the analytical hierarchy process, it is important to extract a cardinal ranking vector from a reciprocal matrix that is unlikely to be consistent. It is natural to choose such a vector only from efficient ones. Recently, a method to generate inductively all efficient vectors for any reciprocal matrix has been discovered. Here we focus on the study of efficient vectors for a reciprocal matrix that is a block perturbation of a consistent matrix in the sense that it is obtained from a consistent matrix by modifying entries only in a proper principal submatrix. We determine an explicit class of efficient vectors for such matrices. Based on this, we give a description of all the efficient vectors in the 3-by-3 block perturbed case. In addition, we give sufficient conditions for the right Perron eigenvector of such matrices to be efficient and provide examples in which efficiency does not occur. Also, we consider a certain type of constant block perturbed consistent matrices, for which we may construct a class of efficient vectors, and demonstrate the efficiency of the Perron eigenvector. Appropriate examples are provided throughout.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 601-618 页，2024 年 3 月。摘要在基于成对比较的优先级排序方案（如分析层次过程）中，从倒易矩阵中提取一个不可能一致的心排序向量非常重要。从有效的向量中选择这样一个向量是很自然的。最近，人们发现了一种方法，可以归纳生成任何倒易矩阵的所有有效向量。在这里，我们重点研究倒易矩阵的有效向量，倒易矩阵是一致矩阵的块扰动，即它是由一致矩阵通过只修改适当的主子矩阵中的条目得到的。我们为这类矩阵确定了一类明确的有效向量。在此基础上，我们给出了 3 乘 3 块扰动情况下所有高效向量的描述。此外，我们还给出了此类矩阵的右佩伦特征向量有效的充分条件，并举例说明了不存在有效特征向量的情况。此外，我们还考虑了某类恒定块扰动一致矩阵，对于这类矩阵，我们可以构建一类高效向量，并证明 Perron 特征向量的高效性。我们还提供了适当的例子。

{"title":"Efficient Vectors for Block Perturbed Consistent Matrices","authors":"Susana Furtado, Charles Johnson","doi":"10.1137/23m1580310","DOIUrl":"https://doi.org/10.1137/23m1580310","url":null,"abstract":"SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 601-618, March 2024. Abstract. In prioritization schemes, based on pairwise comparisons, such as the analytical hierarchy process, it is important to extract a cardinal ranking vector from a reciprocal matrix that is unlikely to be consistent. It is natural to choose such a vector only from efficient ones. Recently, a method to generate inductively all efficient vectors for any reciprocal matrix has been discovered. Here we focus on the study of efficient vectors for a reciprocal matrix that is a block perturbation of a consistent matrix in the sense that it is obtained from a consistent matrix by modifying entries only in a proper principal submatrix. We determine an explicit class of efficient vectors for such matrices. Based on this, we give a description of all the efficient vectors in the 3-by-3 block perturbed case. In addition, we give sufficient conditions for the right Perron eigenvector of such matrices to be efficient and provide examples in which efficiency does not occur. Also, we consider a certain type of constant block perturbed consistent matrices, for which we may construct a class of efficient vectors, and demonstrate the efficiency of the Perron eigenvector. Appropriate examples are provided throughout.","PeriodicalId":49538,"journal":{"name":"SIAM Journal on Matrix Analysis and Applications","volume":"595 ","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139756194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Perturbation and Inverse Problems of Stochastic Matrices 随机矩阵的扰动和逆问题

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-08 DOI: 10.1137/22m1489162

Joost Berkhout, Bernd Heidergott, Paul Van Dooren

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 553-584, March 2024.
Abstract. It is a classical task in perturbation analysis to find norm bounds on the effect of a perturbation [math] of a stochastic matrix [math] to its stationary distribution, i.e., to the unique normalized left Perron eigenvector. A common assumption is to consider [math] to be given and to find bounds on its impact, but in this paper, we rather focus on an inverse optimization problem called the target stationary distribution problem (TSDP). The starting point is a target stationary distribution, and we search for a perturbation [math] of the minimum norm such that [math] remains stochastic and has the desired target stationary distribution. It is shown that TSDP has relevant applications in the design of, for example, road networks, social networks, hyperlink networks, and queuing systems. The key to our approach is that we work with rank-1 perturbations. Building on those results for rank-1 perturbations, we provide heuristics for the TSDP that construct arbitrary rank perturbations as sums of appropriately constructed rank-1 perturbations.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 553-584 页，2024 年 3 月。摘要。随机矩阵[math]的扰动[math]对其静态分布的影响，即对唯一归一化左佩伦特征向量的影响，是扰动分析中的一项经典任务。一个常见的假设是将[math]视为给定的，并找出其影响的边界，但在本文中，我们更关注一个反向优化问题，即目标静态分布问题（TSDP）。起点是一个目标静态分布，我们寻找一个最小规范的扰动[math]，使[math]保持随机，并具有所需的目标静态分布。研究表明，TSDP 可应用于道路网络、社交网络、超链接网络和排队系统等的设计。我们方法的关键在于我们使用的是秩-1扰动。基于这些针对秩-1扰动的结果，我们为 TSDP 提供了启发式方法，将任意秩扰动构造为适当构造的秩-1 扰动之和。

{"title":"Perturbation and Inverse Problems of Stochastic Matrices","authors":"Joost Berkhout, Bernd Heidergott, Paul Van Dooren","doi":"10.1137/22m1489162","DOIUrl":"https://doi.org/10.1137/22m1489162","url":null,"abstract":"SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 553-584, March 2024. Abstract. It is a classical task in perturbation analysis to find norm bounds on the effect of a perturbation [math] of a stochastic matrix [math] to its stationary distribution, i.e., to the unique normalized left Perron eigenvector. A common assumption is to consider [math] to be given and to find bounds on its impact, but in this paper, we rather focus on an inverse optimization problem called the target stationary distribution problem (TSDP). The starting point is a target stationary distribution, and we search for a perturbation [math] of the minimum norm such that [math] remains stochastic and has the desired target stationary distribution. It is shown that TSDP has relevant applications in the design of, for example, road networks, social networks, hyperlink networks, and queuing systems. The key to our approach is that we work with rank-1 perturbations. Building on those results for rank-1 perturbations, we provide heuristics for the TSDP that construct arbitrary rank perturbations as sums of appropriately constructed rank-1 perturbations.","PeriodicalId":49538,"journal":{"name":"SIAM Journal on Matrix Analysis and Applications","volume":"25 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139756461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Spectrum Maximizing Products Are Not Generically Unique 频谱最大化产品并非一般独一无二

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-08 DOI: 10.1137/23m1550621

Jairo Bochi, Piotr Laskawiec

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 585-600, March 2024.
Abstract. It is widely believed that typical finite families of [math] matrices admit finite products that attain the joint spectral radius. This conjecture is supported by computational experiments and it naturally leads to the following question: are these spectrum maximizing products typically unique, up to cyclic permutations and powers? We answer this question negatively. As discovered by Horowitz around fifty years ago, there are products of matrices that always have the same spectral radius despite not being cyclic permutations of one another. We show that the simplest Horowitz products can be spectrum maximizing in a robust way; more precisely, we exhibit a small but nonempty open subset of pairs of [math] matrices [math] for which the products [math] and [math] are both spectrum maximizing.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 585-600 页，2024 年 3 月。摘要。人们普遍认为，[数学]矩阵的典型有限族允许达到联合谱半径的有限乘积。这一猜想得到了计算实验的支持，并自然而然地引出了下面的问题：这些频谱最大化乘积是否通常是唯一的，直至循环排列和幂级数？我们的回答是否定的。正如霍洛维茨（Horowitz）在五十年前发现的那样，有一些矩阵的乘积尽管不是彼此的循环排列，却总是具有相同的频谱半径。我们证明，最简单的霍洛维茨乘积也能以稳健的方式实现频谱最大化；更确切地说，我们展示了[math]矩阵[math]对的一个小而非空的开放子集，对于这个子集，[math]和[math]乘积都能实现频谱最大化。

引用次数: 0

Five-Precision GMRES-Based Iterative Refinement 基于五精度 GMRES 的迭代精炼

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-08 DOI: 10.1137/23m1549079

Patrick Amestoy, Alfredo Buttari, Nicholas J. Higham, Jean-Yves L’Excellent, Theo Mary, Bastien Vieublé

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 529-552, March 2024.
Abstract. GMRES-based iterative refinement in three precisions (GMRES-IR3), proposed by Carson and Higham in 2018, uses a low precision LU factorization to accelerate the solution of a linear system without compromising numerical stability or robustness. GMRES-IR3 solves the update equation of iterative refinement using GMRES preconditioned by the LU factors, where all operations within GMRES are carried out in the working precision [math], except for the matrix–vector products and the application of the preconditioner, which require the use of extra precision [math]. The use of extra precision can be expensive, and is especially unattractive if it is not available in hardware; for this reason, existing implementations have not used extra precision, despite the absence of an error analysis for this approach. In this article, we propose to relax the requirements on the precisions used within GMRES, allowing the use of arbitrary precisions [math] for applying the preconditioned matrix–vector product and [math] for the rest of the operations. We obtain the five-precision GMRES-based iterative refinement (GMRES-IR5) algorithm which has the potential to solve relatively badly conditioned problems in less time and memory than GMRES-IR3. We develop a rounding error analysis that generalizes that of GMRES-IR3, obtaining conditions under which the forward and backward errors converge to their limiting values. Our analysis makes use of a new result on the backward stability of MGS-GMRES in two precisions. On hardware where three or more arithmetics are available, which is becoming very common, the number of possible combinations of precisions in GMRES-IR5 is extremely large. We provide an analysis of our theoretical results that identifies a relatively small subset of relevant combinations. By choosing from within this subset one can achieve different levels of tradeoff between cost and robustness, which allows for a finer choice of precisions depending on the problem difficulty and the available hardware. We carry out numerical experiments on random dense and SuiteSparse matrices to validate our theoretical analysis and discuss the complexity of GMRES-IR5.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 529-552 页，2024 年 3 月。摘要。基于GMRES的三精度迭代精化（GMRES-IR3）由Carson和Higham于2018年提出，利用低精度LU因式分解加速线性系统的求解，同时不影响数值稳定性和鲁棒性。GMRES-IR3 使用以 LU 因子为前提条件的 GMRES 求解迭代细化的更新方程，其中 GMRES 内的所有操作都在工作精度内进行[math]，只有矩阵向量积和前提条件器的应用需要使用额外精度[math]。使用额外精度的成本可能很高，如果硬件中没有额外精度，则尤其不划算；因此，尽管没有对这种方法进行误差分析，但现有的实现都没有使用额外精度。在本文中，我们建议放宽对 GMRES 中所用精度的要求，允许在应用预处理矩阵-矢量乘时使用任意精度 [math]，在其余操作中使用 [math]。我们得到了基于五精度 GMRES 的迭代精化（GMRES-IR5）算法，与 GMRES-IR3 相比，它有可能以更少的时间和内存解决条件相对较差的问题。我们对 GMRES-IR3 算法进行了舍入误差分析，得到了前向和后向误差收敛到极限值的条件。我们的分析利用了关于 MGS-GMRES 在两种精度下的后向稳定性的新结果。在有三个或更多算术运算的硬件上（这已变得非常普遍），GMRES-IR5 中可能的精度组合数量极大。我们对理论结果进行了分析，确定了相对较小的相关组合子集。通过在这个子集中进行选择，可以在成本和鲁棒性之间实现不同程度的权衡，从而根据问题的难度和可用的硬件，对精度进行更精细的选择。我们对随机密集矩阵和SuiteSparse矩阵进行了数值实验，以验证我们的理论分析，并讨论GMRES-IR5的复杂性。

{"title":"Five-Precision GMRES-Based Iterative Refinement","authors":"Patrick Amestoy, Alfredo Buttari, Nicholas J. Higham, Jean-Yves L’Excellent, Theo Mary, Bastien Vieublé","doi":"10.1137/23m1549079","DOIUrl":"https://doi.org/10.1137/23m1549079","url":null,"abstract":"SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 529-552, March 2024. Abstract. GMRES-based iterative refinement in three precisions (GMRES-IR3), proposed by Carson and Higham in 2018, uses a low precision LU factorization to accelerate the solution of a linear system without compromising numerical stability or robustness. GMRES-IR3 solves the update equation of iterative refinement using GMRES preconditioned by the LU factors, where all operations within GMRES are carried out in the working precision [math], except for the matrix–vector products and the application of the preconditioner, which require the use of extra precision [math]. The use of extra precision can be expensive, and is especially unattractive if it is not available in hardware; for this reason, existing implementations have not used extra precision, despite the absence of an error analysis for this approach. In this article, we propose to relax the requirements on the precisions used within GMRES, allowing the use of arbitrary precisions [math] for applying the preconditioned matrix–vector product and [math] for the rest of the operations. We obtain the five-precision GMRES-based iterative refinement (GMRES-IR5) algorithm which has the potential to solve relatively badly conditioned problems in less time and memory than GMRES-IR3. We develop a rounding error analysis that generalizes that of GMRES-IR3, obtaining conditions under which the forward and backward errors converge to their limiting values. Our analysis makes use of a new result on the backward stability of MGS-GMRES in two precisions. On hardware where three or more arithmetics are available, which is becoming very common, the number of possible combinations of precisions in GMRES-IR5 is extremely large. We provide an analysis of our theoretical results that identifies a relatively small subset of relevant combinations. By choosing from within this subset one can achieve different levels of tradeoff between cost and robustness, which allows for a finer choice of precisions depending on the problem difficulty and the available hardware. We carry out numerical experiments on random dense and SuiteSparse matrices to validate our theoretical analysis and discuss the complexity of GMRES-IR5.","PeriodicalId":49538,"journal":{"name":"SIAM Journal on Matrix Analysis and Applications","volume":"33 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139756455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Unifying Framework for Higher Order Derivatives of Matrix Functions 矩阵函数高阶导数的统一框架

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-08 DOI: 10.1137/23m1580589

Emanuel H. Rubensson

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 504-528, March 2024.
Abstract. We present a theory for general partial derivatives of matrix functions of the form [math], where [math] is a matrix path of several variables ([math]). Building on results by Mathias [SIAM J. Matrix Anal. Appl., 17 (1996), pp. 610–620] for the first order derivative, we develop a block upper triangular form for higher order partial derivatives. This block form is used to derive conditions for existence and a generalized Daleckiĭ–Kreĭn formula for higher order derivatives. We show that certain specializations of this formula lead to classical formulas of quantum perturbation theory. We show how our results are related to earlier results for higher order Fréchet derivatives. Block forms of complex step approximations are introduced, and we show how those are related to evaluation of derivatives through the upper triangular form. These relations are illustrated with numerical examples.

SIAM 矩阵分析与应用期刊》，第 45 卷第 1 期，第 504-528 页，2024 年 3 月。摘要。我们提出了[math]形式矩阵函数的一般偏导数理论，其中[math]是多变量矩阵路径（[math]）。基于马蒂亚斯 [SIAM J. Matrix Anal. Appl.我们利用这种分块形式推导出高阶导数的存在条件和广义 Daleckiĭ-Kreĭn 公式。我们证明，该公式的某些特殊化会导致量子扰动理论的经典公式。我们展示了我们的结果与早先关于高阶弗雷谢特导数的结果之间的关系。我们介绍了复步近似的块形式，并说明了这些块形式与通过上三角形式求导的关系。我们将用数值示例来说明这些关系。

引用次数: 0

Row or Column Completion of Polynomial Matrices of Given Degree 给定度数多项式矩阵的行或列补全

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-07 DOI: 10.1137/23m1564547

Agurtzane Amparan, Itziar Baragaña, Silvia Marcaida, Alicia Roca

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 478-503, March 2024.
Abstract. We solve the problem of characterizing the existence of a polynomial matrix of fixed degree when its eigenstructure (or part of it) and some of its rows (columns) are prescribed. More specifically, we present a solution to the row (column) completion problem of a polynomial matrix of given degree under different prescribed invariants: the whole eigenstructure, all of it but the row (column) minimal indices, and the finite and/or infinite structures. Moreover, we characterize the existence of a polynomial matrix with prescribed degree and eigenstructure over an arbitrary field.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 478-503 页，2024 年 3 月。摘要我们解决了当一个固定阶数的多项式矩阵的特征结构（或部分特征结构）及其部分行（列）被规定时，该矩阵的存在性问题。更具体地说，我们提出了给定度数的多项式矩阵在不同规定不变式下行（列）补全问题的解决方案：整个特征结构、除行（列）最小索引外的所有特征结构以及有限和/或无限结构。此外，我们还描述了任意域上具有规定度和特征结构的多项式矩阵的存在性。

引用次数: 0

Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation 多重张量-时间-矩阵计算的通信下限和最优算法

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-06 DOI: 10.1137/22m1510443

Hussam Al Daas, Grey Ballard, Laura Grigori, Suraj Kumar, Kathryn Rouse

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 450-477, March 2024.
Abstract. Multiple tensor-times-matrix (Multi-TTM) is a key computation in algorithms for computing and operating with the Tucker tensor decomposition, which is frequently used in multidimensional data analysis. We establish communication lower bounds that determine how much data movement is required (under mild conditions) to perform the Multi-TTM computation in parallel. The crux of the proof relies on analytically solving a constrained, nonlinear optimization problem. We also present a parallel algorithm to perform this computation that organizes the processors into a logical grid with twice as many modes as the input tensor. We show that, with correct choices of grid dimensions, the communication cost of the algorithm attains the lower bounds and is therefore communication optimal. Finally, we show that our algorithm can significantly reduce communication compared to the straightforward approach of expressing the computation as a sequence of tensor-times-matrix operations when the input and output tensors vary greatly in size.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 450-477 页，2024 年 3 月。摘要。多重张量-时间-矩阵（Multi-TTM）是计算和操作塔克张量分解算法中的一项关键计算，常用于多维数据分析。我们建立了通信下限，确定了并行执行 Multi-TTM 计算所需的数据移动量（在温和条件下）。证明的关键在于分析求解一个有约束的非线性优化问题。我们还提出了一种执行该计算的并行算法，该算法将处理器组织成一个逻辑网格，其模式数量是输入张量的两倍。我们证明，只要正确选择网格维数，算法的通信成本就能达到下限，因此是通信最优的。最后，我们证明，当输入和输出张量的大小相差很大时，与直接将计算表达为张量-时间-矩阵运算序列的方法相比，我们的算法可以显著减少通信量。

{"title":"Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation","authors":"Hussam Al Daas, Grey Ballard, Laura Grigori, Suraj Kumar, Kathryn Rouse","doi":"10.1137/22m1510443","DOIUrl":"https://doi.org/10.1137/22m1510443","url":null,"abstract":"SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 450-477, March 2024. Abstract. Multiple tensor-times-matrix (Multi-TTM) is a key computation in algorithms for computing and operating with the Tucker tensor decomposition, which is frequently used in multidimensional data analysis. We establish communication lower bounds that determine how much data movement is required (under mild conditions) to perform the Multi-TTM computation in parallel. The crux of the proof relies on analytically solving a constrained, nonlinear optimization problem. We also present a parallel algorithm to perform this computation that organizes the processors into a logical grid with twice as many modes as the input tensor. We show that, with correct choices of grid dimensions, the communication cost of the algorithm attains the lower bounds and is therefore communication optimal. Finally, we show that our algorithm can significantly reduce communication compared to the straightforward approach of expressing the computation as a sequence of tensor-times-matrix operations when the input and output tensors vary greatly in size.","PeriodicalId":49538,"journal":{"name":"SIAM Journal on Matrix Analysis and Applications","volume":"23 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139756471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Backpropagation through Back Substitution with a Backslash 通过带反斜线的反向置换进行反向传播

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-05 DOI: 10.1137/22m1532871

Alan Edelman, Ekin Akyürek, Yuyang Wang

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 429-449, March 2024.
Abstract. We present a linear algebra formulation of backpropagation which allows the calculation of gradients by using a generically written “backslash” or Gaussian elimination on triangular systems of equations. Generally, the matrix elements are operators. This paper has three contributions: (i) it is of intellectual value to replace traditional treatments of automatic differentiation with a (left acting) operator theoretic, graph-based approach; (ii) operators can be readily placed in matrices in software in programming languages such as Julia as an implementation option; (iii) we introduce a novel notation, “transpose dot” operator “[math]” that allows for the reversal of operators. We further demonstrate the elegance of the operators approach in a suitable programming language consisting of generic linear algebra operators such as Julia [Bezanson et al., SIAM Rev., 59 (2017), pp. 65–98], and that it is possible to realize this abstraction in code. Our implementation shows how generic linear algebra can allow operators as elements of matrices. In contrast to “operator overloading,” where backslash would normally have to be rewritten to take advantage of operators, with “generic programming” there is no such need.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 429-449 页，2024 年 3 月。摘要。我们提出了一种反向传播的线性代数公式，它允许在三角方程组中使用通用的 "反斜线 "或高斯消元法计算梯度。一般来说，矩阵元素都是算子。本文有三方面的贡献：(i) 用（左演算）算子理论、基于图的方法取代传统的自动微分处理方法，具有重要的思想价值；(ii) 作为一种实现方案，算子可以很容易地放在软件的矩阵中，如 Julia 等编程语言；(iii) 我们引入了一种新颖的符号，即 "转置点 "算子"[math]"，允许算子反转。我们进一步证明，在由通用线性代数算子组成的合适编程语言（如 Julia）中，算子方法是优雅的[Bezanson 等人，SIAM Rev.，59 (2017)，第 65-98 页]，而且有可能在代码中实现这种抽象。我们的实现展示了通用线性代数如何允许算子作为矩阵的元素。与 "运算符重载 "相比，"泛型编程 "通常需要重写反斜线才能利用运算符的优势，而 "泛型编程 "则没有这种必要。

{"title":"Backpropagation through Back Substitution with a Backslash","authors":"Alan Edelman, Ekin Akyürek, Yuyang Wang","doi":"10.1137/22m1532871","DOIUrl":"https://doi.org/10.1137/22m1532871","url":null,"abstract":"SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 429-449, March 2024. Abstract. We present a linear algebra formulation of backpropagation which allows the calculation of gradients by using a generically written “backslash” or Gaussian elimination on triangular systems of equations. Generally, the matrix elements are operators. This paper has three contributions: (i) it is of intellectual value to replace traditional treatments of automatic differentiation with a (left acting) operator theoretic, graph-based approach; (ii) operators can be readily placed in matrices in software in programming languages such as Julia as an implementation option; (iii) we introduce a novel notation, “transpose dot” operator “[math]” that allows for the reversal of operators. We further demonstrate the elegance of the operators approach in a suitable programming language consisting of generic linear algebra operators such as Julia [Bezanson et al., SIAM Rev., 59 (2017), pp. 65–98], and that it is possible to realize this abstraction in code. Our implementation shows how generic linear algebra can allow operators as elements of matrices. In contrast to “operator overloading,” where backslash would normally have to be rewritten to take advantage of operators, with “generic programming” there is no such need.","PeriodicalId":49538,"journal":{"name":"SIAM Journal on Matrix Analysis and Applications","volume":"15 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139756458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

More on Tensors with Different Rank and Symmetric Rank 关于不同秩和对称秩的张量的更多信息

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-02-05 DOI: 10.1137/23m1547159

Yaroslav Shitov

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 419-428, March 2024.
Abstract. This is a further discussion of a previous work of the author on tensors with different rank and symmetric rank. We point out several obstructions towards extending a complex number example to the real number setting and discuss several further questions raised in the literature.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 419-428 页，2024 年 3 月。摘要。这是对作者以前关于不同秩和对称秩张量的工作的进一步讨论。我们指出了将复数例子推广到实数环境的几个障碍，并讨论了文献中提出的几个进一步的问题。

引用次数: 0

Weighted Enumeration of Nonbacktracking Walks on Weighted Graphs 加权图上非回溯行走的加权枚举

IF 1.5 2区数学 Q2 MATHEMATICS, APPLIED

SIAM Journal on Matrix Analysis and Applications

Pub Date : 2024-01-30 DOI: 10.1137/23m155219x

Francesca Arrigo, Desmond J. Higham, Vanni Noferini, Ryan Wood

SIAM Journal on Matrix Analysis and Applications, Volume 45, Issue 1, Page 397-418, March 2024.
Abstract. We extend the notion of nonbacktracking walks from unweighted graphs to graphs whose edges have a nonnegative weight. Here the weight associated with a walk is taken to be the product over the weights along the individual edges. We give two ways to compute the associated generating function, and corresponding node centrality measures. One method works directly on the original graph and one uses a line graph construction followed by a projection. The first method is more efficient, but the second has the advantage of extending naturally to time-evolving graphs. Based on these generating functions, we define and study corresponding centrality measures. Illustrative computational results are also provided.

SIAM 矩阵分析与应用期刊》，第 45 卷，第 1 期，第 397-418 页，2024 年 3 月。摘要我们将无权重图中的非回溯走行概念扩展到边具有非负权重的图。在这里，与走行相关的权重是各条边权重的乘积。我们给出了两种计算相关生成函数和相应节点中心度量的方法。一种方法是直接计算原始图，另一种方法是先构建线图，然后进行投影。第一种方法更有效，但第二种方法的优点是可以自然扩展到时间演化图。基于这些生成函数，我们定义并研究了相应的中心性度量。我们还提供了示例性计算结果。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

SIAM Journal on Matrix Analysis and Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀