{"title":"SJSON: A succinct representation for JavaScript object notation documents","authors":"Edman Anjos, Junhee Lee, S. Rao","doi":"10.1109/ICDIM.2016.7829787","DOIUrl":null,"url":null,"abstract":"The massive amounts of data processed in modern computational systems is becoming a problem of increasing importance. This data is commonly stored directly or indirectly through the use of data exchange languages, such as JavaScript Object Notation (JSON), for human-readable platform agnostic access. This paper focuses on describing and analyzing SJSON, a library that explores succinct representations of JSON documents as a means to achieve reduced memory usage of files in main memory, and to permit the compression of JSON files stored in disk. In SJSON we represent the document structure with succinct trees, as opposed to the usual pointer-based implementation. Furthermore, the remaining raw data are organized in arrays of attributes and values. Attributes are stripped of redundancies and stored in a simple contiguous array, while values are represented through a bit string indexed array. The scheme here proposed is then evaluated with respect to a number of metrics comparing its performance with popular libraries, anssd possible improvements to the representation are then presented.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2016.7829787","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The massive amounts of data processed in modern computational systems is becoming a problem of increasing importance. This data is commonly stored directly or indirectly through the use of data exchange languages, such as JavaScript Object Notation (JSON), for human-readable platform agnostic access. This paper focuses on describing and analyzing SJSON, a library that explores succinct representations of JSON documents as a means to achieve reduced memory usage of files in main memory, and to permit the compression of JSON files stored in disk. In SJSON we represent the document structure with succinct trees, as opposed to the usual pointer-based implementation. Furthermore, the remaining raw data are organized in arrays of attributes and values. Attributes are stripped of redundancies and stored in a simple contiguous array, while values are represented through a bit string indexed array. The scheme here proposed is then evaluated with respect to a number of metrics comparing its performance with popular libraries, anssd possible improvements to the representation are then presented.