{"title":"svds-C:计算截断奇异值分解的多线程 C 代码","authors":"Xu Feng, Wenjian Yu, Yuyang Xie","doi":"arxiv-2405.18966","DOIUrl":null,"url":null,"abstract":"This article presents svds-C, an open-source and high-performance C program\nfor accurately and robustly computing truncated SVD, e.g. computing several\nlargest singular values and corresponding singular vectors. We have\nre-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS\nand multi-thread computing to obtain the parallel program named svds-C. svds-C\nrunning on shared-memory computer consumes less time and memory than svds\nthanks to careful implementation of multi-thread parallelization and memory\nmanagement. Numerical experiments on different test cases which are\nsynthetically generated or directly from real world datasets show that, svds-C\nruns remarkably faster than svds with averagely 4.7X and at most 12X speedup\nfor 16-thread parallel computing on a computer with Intel CPU, while preserving\nsame accuracy and consuming about half memory space. Experimental results also\ndemonstrate that svds-C has similar advantages over svds on the computer with\nAMD CPU, and outperforms other state-of-the-art algorithms for truncated SVD on\ncomputing time and robustness.","PeriodicalId":501256,"journal":{"name":"arXiv - CS - Mathematical Software","volume":"34 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"svds-C: A Multi-Thread C Code for Computing Truncated Singular Value Decomposition\",\"authors\":\"Xu Feng, Wenjian Yu, Yuyang Xie\",\"doi\":\"arxiv-2405.18966\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This article presents svds-C, an open-source and high-performance C program\\nfor accurately and robustly computing truncated SVD, e.g. computing several\\nlargest singular values and corresponding singular vectors. We have\\nre-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS\\nand multi-thread computing to obtain the parallel program named svds-C. svds-C\\nrunning on shared-memory computer consumes less time and memory than svds\\nthanks to careful implementation of multi-thread parallelization and memory\\nmanagement. Numerical experiments on different test cases which are\\nsynthetically generated or directly from real world datasets show that, svds-C\\nruns remarkably faster than svds with averagely 4.7X and at most 12X speedup\\nfor 16-thread parallel computing on a computer with Intel CPU, while preserving\\nsame accuracy and consuming about half memory space. Experimental results also\\ndemonstrate that svds-C has similar advantages over svds on the computer with\\nAMD CPU, and outperforms other state-of-the-art algorithms for truncated SVD on\\ncomputing time and robustness.\",\"PeriodicalId\":501256,\"journal\":{\"name\":\"arXiv - CS - Mathematical Software\",\"volume\":\"34 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Mathematical Software\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.18966\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Mathematical Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.18966","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文介绍的 svds-C 是一个开源的高性能 C 程序,用于精确、稳健地计算截断 SVD,例如计算几个最大奇异值和相应的奇异向量。由于精心实现了多线程并行化和内存管理,在共享内存计算机上运行的svds-C比svdst消耗更少的时间和内存。在不同测试用例上进行的数值实验表明,svds-C 运行速度明显快于 svds,在英特尔 CPU 计算机上进行 16 线程并行计算时,平均速度提高了 4.7 倍,最多提高了 12 倍,同时保持了相同的精度,内存空间消耗约为 svds 的一半。实验结果还证明,在使用 AMD CPU 的计算机上,svds-C 与 svds 相比具有类似的优势,并且在计算时间和鲁棒性方面优于其他最先进的截断 SVD 算法。
svds-C: A Multi-Thread C Code for Computing Truncated Singular Value Decomposition
This article presents svds-C, an open-source and high-performance C program
for accurately and robustly computing truncated SVD, e.g. computing several
largest singular values and corresponding singular vectors. We have
re-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS
and multi-thread computing to obtain the parallel program named svds-C. svds-C
running on shared-memory computer consumes less time and memory than svds
thanks to careful implementation of multi-thread parallelization and memory
management. Numerical experiments on different test cases which are
synthetically generated or directly from real world datasets show that, svds-C
runs remarkably faster than svds with averagely 4.7X and at most 12X speedup
for 16-thread parallel computing on a computer with Intel CPU, while preserving
same accuracy and consuming about half memory space. Experimental results also
demonstrate that svds-C has similar advantages over svds on the computer with
AMD CPU, and outperforms other state-of-the-art algorithms for truncated SVD on
computing time and robustness.