{"title":"Distributed LSI: Parallel preprocessing and vector sharing","authors":"R. Bradford","doi":"10.1109/ISI.2015.7165973","DOIUrl":null,"url":null,"abstract":"The technique of latent semantic indexing (LSI) has a wide variety of uses in intelligence and security informatics applications. LSI processing generates high-dimensional vectors that are used to represent individual items of interest and the features of which those items are composed. Historically, LSI representation vectors have been generated in a single computing environment (workstation, server, or VM instance). However, this is not a requirement. This paper describes two approaches to distributing elements of LSI processing. The first, parallelization of the preprocessing stage, can significantly decrease the time required for creation of LSI indexes. The second, vector sharing, can dramatically improve security in distributed LSI environments.","PeriodicalId":292352,"journal":{"name":"2015 IEEE International Conference on Intelligence and Security Informatics (ISI)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Intelligence and Security Informatics (ISI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISI.2015.7165973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The technique of latent semantic indexing (LSI) has a wide variety of uses in intelligence and security informatics applications. LSI processing generates high-dimensional vectors that are used to represent individual items of interest and the features of which those items are composed. Historically, LSI representation vectors have been generated in a single computing environment (workstation, server, or VM instance). However, this is not a requirement. This paper describes two approaches to distributing elements of LSI processing. The first, parallelization of the preprocessing stage, can significantly decrease the time required for creation of LSI indexes. The second, vector sharing, can dramatically improve security in distributed LSI environments.