{"title":"svds-C: A Multi-Thread C Code for Computing Truncated Singular Value Decomposition","authors":"Xu Feng, Wenjian Yu, Yuyang Xie","doi":"arxiv-2405.18966","DOIUrl":null,"url":null,"abstract":"This article presents svds-C, an open-source and high-performance C program\nfor accurately and robustly computing truncated SVD, e.g. computing several\nlargest singular values and corresponding singular vectors. We have\nre-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS\nand multi-thread computing to obtain the parallel program named svds-C. svds-C\nrunning on shared-memory computer consumes less time and memory than svds\nthanks to careful implementation of multi-thread parallelization and memory\nmanagement. Numerical experiments on different test cases which are\nsynthetically generated or directly from real world datasets show that, svds-C\nruns remarkably faster than svds with averagely 4.7X and at most 12X speedup\nfor 16-thread parallel computing on a computer with Intel CPU, while preserving\nsame accuracy and consuming about half memory space. Experimental results also\ndemonstrate that svds-C has similar advantages over svds on the computer with\nAMD CPU, and outperforms other state-of-the-art algorithms for truncated SVD on\ncomputing time and robustness.","PeriodicalId":501256,"journal":{"name":"arXiv - CS - Mathematical Software","volume":"34 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Mathematical Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.18966","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This article presents svds-C, an open-source and high-performance C program
for accurately and robustly computing truncated SVD, e.g. computing several
largest singular values and corresponding singular vectors. We have
re-implemented the algorithm of svds in Matlab in C based on MKL or OpenBLAS
and multi-thread computing to obtain the parallel program named svds-C. svds-C
running on shared-memory computer consumes less time and memory than svds
thanks to careful implementation of multi-thread parallelization and memory
management. Numerical experiments on different test cases which are
synthetically generated or directly from real world datasets show that, svds-C
runs remarkably faster than svds with averagely 4.7X and at most 12X speedup
for 16-thread parallel computing on a computer with Intel CPU, while preserving
same accuracy and consuming about half memory space. Experimental results also
demonstrate that svds-C has similar advantages over svds on the computer with
AMD CPU, and outperforms other state-of-the-art algorithms for truncated SVD on
computing time and robustness.