Tugba Torun, Eren Yenigul, Ameer Taweel, Didem Unat
{"title":"A Sparse Tensor Generator with Efficient Feature Extraction","authors":"Tugba Torun, Eren Yenigul, Ameer Taweel, Didem Unat","doi":"arxiv-2405.04944","DOIUrl":null,"url":null,"abstract":"Sparse tensor operations are gaining attention in emerging applications such\nas social networks, deep learning, diagnosis, crime, and review analysis.\nHowever, a major obstacle for research in sparse tensor operations is the\ndeficiency of a broad-scale sparse tensor dataset. Another challenge in sparse\ntensor operations is examining the sparse tensor features, which are not only\nimportant for revealing its nonzero pattern but also have a significant impact\non determining the best-suited storage format, the decomposition algorithm, and\nthe reordering methods. However, due to the large sizes of real tensors, even\nextracting these features becomes costly without caution. To address these gaps\nin the literature, we have developed a smart sparse tensor generator that\nmimics the substantial features of real sparse tensors. Moreover, we propose\nvarious methods for efficiently extracting an extensive set of features for\nsparse tensors. The effectiveness of our generator is validated through the\nquality of features and the performance of decomposition in the generated\ntensors. Both the sparse tensor feature extractor and the tensor generator are\nopen source with all the artifacts available at\nhttps://github.com/sparcityeu/feaTen and https://github.com/sparcityeu/genTen,\nrespectively.","PeriodicalId":501256,"journal":{"name":"arXiv - CS - Mathematical Software","volume":"58 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Mathematical Software","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.04944","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sparse tensor operations are gaining attention in emerging applications such
as social networks, deep learning, diagnosis, crime, and review analysis.
However, a major obstacle for research in sparse tensor operations is the
deficiency of a broad-scale sparse tensor dataset. Another challenge in sparse
tensor operations is examining the sparse tensor features, which are not only
important for revealing its nonzero pattern but also have a significant impact
on determining the best-suited storage format, the decomposition algorithm, and
the reordering methods. However, due to the large sizes of real tensors, even
extracting these features becomes costly without caution. To address these gaps
in the literature, we have developed a smart sparse tensor generator that
mimics the substantial features of real sparse tensors. Moreover, we propose
various methods for efficiently extracting an extensive set of features for
sparse tensors. The effectiveness of our generator is validated through the
quality of features and the performance of decomposition in the generated
tensors. Both the sparse tensor feature extractor and the tensor generator are
open source with all the artifacts available at
https://github.com/sparcityeu/feaTen and https://github.com/sparcityeu/genTen,
respectively.