J. Kon, Gil Jae Lee, J. Fortes, Saneyasu Yamaguchi
{"title":"A Kernel-Based Method for Resolving Performance Inefficiencies in Mining Frequent-Patterns in Encrypted Data","authors":"J. Kon, Gil Jae Lee, J. Fortes, Saneyasu Yamaguchi","doi":"10.1109/CANDARW.2018.00098","DOIUrl":null,"url":null,"abstract":"Big-data analytics is increasingly important in today's data-centric world. In this context, data encryption is a powerful tool for storing and analyzing private data. In particular, fully homomorphic encryption (FHE) is a promising encryption method that allows the analysis of encrypted data without need for decryption. FHE therefore enables users to outsource data storage and processing to a public cloud system without disclosing their data. However, FHE significantly increases data size and processing time, thus making it essential to improve performance in both I/O and processing. In most cases, the behavior of CPU resource consumption can be monitored and understood from code structure and logic. On the contrary, I/O resource consumption, which is controlled by the operating system kernel, is much harder to observe and understand. This paper addresses this issue in the context of a widely used data-analytics technique for secure frequent-pattern mining, called FHE Apriori. First, we propose a method for observing the I/O requests of FHE Apriori by modifying the operating system kernel. Second, we use the proposed method to characterize the I/O behavior of FHE Apriori and identify inefficiencies of storage access (that can be addressed to improve performance). Third, application-level changes based on this identification are described, enabling prefetching of data at runtime before they are needed. Fourth, the benefit of the described changes is quantitatively evaluated, showing that application performance improves by 23%.","PeriodicalId":329439,"journal":{"name":"2018 Sixth International Symposium on Computing and Networking Workshops (CANDARW)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Sixth International Symposium on Computing and Networking Workshops (CANDARW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CANDARW.2018.00098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Big-data analytics is increasingly important in today's data-centric world. In this context, data encryption is a powerful tool for storing and analyzing private data. In particular, fully homomorphic encryption (FHE) is a promising encryption method that allows the analysis of encrypted data without need for decryption. FHE therefore enables users to outsource data storage and processing to a public cloud system without disclosing their data. However, FHE significantly increases data size and processing time, thus making it essential to improve performance in both I/O and processing. In most cases, the behavior of CPU resource consumption can be monitored and understood from code structure and logic. On the contrary, I/O resource consumption, which is controlled by the operating system kernel, is much harder to observe and understand. This paper addresses this issue in the context of a widely used data-analytics technique for secure frequent-pattern mining, called FHE Apriori. First, we propose a method for observing the I/O requests of FHE Apriori by modifying the operating system kernel. Second, we use the proposed method to characterize the I/O behavior of FHE Apriori and identify inefficiencies of storage access (that can be addressed to improve performance). Third, application-level changes based on this identification are described, enabling prefetching of data at runtime before they are needed. Fourth, the benefit of the described changes is quantitatively evaluated, showing that application performance improves by 23%.