{"title":"使用浮点压缩的 H 矩阵-矢量乘法性能","authors":"Ronald Kriemann","doi":"arxiv-2405.03456","DOIUrl":null,"url":null,"abstract":"Matrix-vector multiplication forms the basis of many iterative solution\nalgorithms and as such is an important algorithm also for hierarchical\nmatrices. However, due to its low computational intensity, its performance is\ntypically limited by the available memory bandwidth. By optimizing the storage\nrepresentation of the data within such matrices, this limitation can be lifted\nand the performance increased. This applies not only to hierarchical matrices\nbut for also for other low-rank approximation schemes, e.g. block low-rank\nmatrices.","PeriodicalId":501256,"journal":{"name":"arXiv - CS - Mathematical Software","volume":"11 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance of H-Matrix-Vector Multiplication with Floating Point Compression\",\"authors\":\"Ronald Kriemann\",\"doi\":\"arxiv-2405.03456\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Matrix-vector multiplication forms the basis of many iterative solution\\nalgorithms and as such is an important algorithm also for hierarchical\\nmatrices. However, due to its low computational intensity, its performance is\\ntypically limited by the available memory bandwidth. By optimizing the storage\\nrepresentation of the data within such matrices, this limitation can be lifted\\nand the performance increased. This applies not only to hierarchical matrices\\nbut for also for other low-rank approximation schemes, e.g. block low-rank\\nmatrices.\",\"PeriodicalId\":501256,\"journal\":{\"name\":\"arXiv - CS - Mathematical Software\",\"volume\":\"11 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Mathematical Software\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.03456\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Mathematical Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.03456","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance of H-Matrix-Vector Multiplication with Floating Point Compression
Matrix-vector multiplication forms the basis of many iterative solution
algorithms and as such is an important algorithm also for hierarchical
matrices. However, due to its low computational intensity, its performance is
typically limited by the available memory bandwidth. By optimizing the storage
representation of the data within such matrices, this limitation can be lifted
and the performance increased. This applies not only to hierarchical matrices
but for also for other low-rank approximation schemes, e.g. block low-rank
matrices.