{"title":"Improving compressed matrix multiplication using control variate method","authors":"Bhisham Dev Verma , Punit Pankaj Dubey , Rameshwar Pratap , Manoj Thakur","doi":"10.1016/j.ipl.2024.106517","DOIUrl":null,"url":null,"abstract":"<div><p>The seminal work by Pagh <span>[1]</span> proposed a matrix multiplication algorithm for real-valued squared matrices called Compressed Matrix Multiplication (CMM) having a sparse matrix output product. The algorithm is based on a popular sketching technique called Count-Sketch <span>[2]</span> and Fast Fourier Transform (FFT). For input square matrices <strong>A</strong> and <strong>B</strong> of order <em>n</em> and the product matrix <strong>AB</strong> with Frobenius norm <span><math><mo>|</mo><mo>|</mo><mrow><mi>AB</mi></mrow><mo>|</mo><msub><mrow><mo>|</mo></mrow><mrow><mi>F</mi></mrow></msub></math></span>, the algorithm offers an unbiased estimate for each entry, <em>i.e.</em>, <span><math><msub><mrow><mo>(</mo><mrow><mi>AB</mi></mrow><mo>)</mo></mrow><mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub></math></span> of the product matrix <strong>AB</strong> with a variance bounded by <span><math><mo>|</mo><mo>|</mo><mrow><mi>AB</mi></mrow><mo>|</mo><msubsup><mrow><mo>|</mo></mrow><mrow><mi>F</mi></mrow><mrow><mn>2</mn></mrow></msubsup><mo>/</mo><mi>b</mi></math></span>, where <em>b</em> is the compressed bucket size. Thus, the variance will eventually become high for a small bucket size. In this work, we address the high variance problem of CMM with the help of a simple and practical technique based on classical variance reduction methods in statistics. Our techniques rely on the Control Variate (CV) method. We suggest rigorous theoretical analysis for variance reduction and complement it via supporting empirical evidence.</p></div>","PeriodicalId":56290,"journal":{"name":"Information Processing Letters","volume":"187 ","pages":"Article 106517"},"PeriodicalIF":0.7000,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020019024000474","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The seminal work by Pagh [1] proposed a matrix multiplication algorithm for real-valued squared matrices called Compressed Matrix Multiplication (CMM) having a sparse matrix output product. The algorithm is based on a popular sketching technique called Count-Sketch [2] and Fast Fourier Transform (FFT). For input square matrices A and B of order n and the product matrix AB with Frobenius norm , the algorithm offers an unbiased estimate for each entry, i.e., of the product matrix AB with a variance bounded by , where b is the compressed bucket size. Thus, the variance will eventually become high for a small bucket size. In this work, we address the high variance problem of CMM with the help of a simple and practical technique based on classical variance reduction methods in statistics. Our techniques rely on the Control Variate (CV) method. We suggest rigorous theoretical analysis for variance reduction and complement it via supporting empirical evidence.
期刊介绍:
Information Processing Letters invites submission of original research articles that focus on fundamental aspects of information processing and computing. This naturally includes work in the broadly understood field of theoretical computer science; although papers in all areas of scientific inquiry will be given consideration, provided that they describe research contributions credibly motivated by applications to computing and involve rigorous methodology. High quality experimental papers that address topics of sufficiently broad interest may also be considered.
Since its inception in 1971, Information Processing Letters has served as a forum for timely dissemination of short, concise and focused research contributions. Continuing with this tradition, and to expedite the reviewing process, manuscripts are generally limited in length to nine pages when they appear in print.