{"title":"PAMDI: Privacy aware missing data inference scheme for sparse mobile crowd sensing","authors":"Tejendrakumar Thakur, N. Marchang","doi":"10.3233/ais-220475","DOIUrl":null,"url":null,"abstract":"The ubiquity of mobile devices has birthed one of the most promising IoT applications called Mobile Crowd Sensing (MCS) wherein mobile devices carried around by a crowd are used to sense phenomena of interest. Subsequently, sensed data are collected, aggregated and analysed to extract useful information. Sparse Mobile Crowd Sensing (SMCS) aims at reducing the sensing overhead (e.g., battery consumption, incentive cost, etc.) by lowering the number of sensing tasks performed. Sensed data thus collected are used to infer missing values. However, it must be ensured that user’s private information (e.g., user’s home location) cannot be derived from the sensed data shared by a user. We propose a novel approach entitled ‘Privacy Aware Missing Data Inference Scheme for Sparse Mobile Crowd Sensing (PAMDI)’ which employs the concept of perceptual hash for ensuring privacy while trying to maintain performance guarantees. Simulation results with the help of two real-world data-sets point towards the feasibility of the proposed approach for provisioning user privacy. We use regression algorithms for missing data inference in PAMDI and find that linear regression algorithms work best with the proposed privacy approach as compared to non-linear regression algorithms. Moreover, we observe that inference accuracy is more or less maintained even after introducing privacy with the proposed approach. In particular, for the first data-set (Temperature data-set), the mean absolute error (MAE) and root mean squared error (RMSE) values obtained by the linear algorithms using the proposed approach are about 2.65 ∘ C and 2 . 9 ∘ C respectively. On the other hand, the corresponding MAE and RMSE values generated by the linear algorithms when no privacy is introduced are about 2.25 ∘ C and 2.85 ∘ C respectively. For non-linear algorithms, the corresponding error values are higher. We also observe the same trend in the results of the second data-set.","PeriodicalId":49316,"journal":{"name":"Journal of Ambient Intelligence and Smart Environments","volume":"62 1","pages":"19-46"},"PeriodicalIF":1.8000,"publicationDate":"2023-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Ambient Intelligence and Smart Environments","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.3233/ais-220475","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The ubiquity of mobile devices has birthed one of the most promising IoT applications called Mobile Crowd Sensing (MCS) wherein mobile devices carried around by a crowd are used to sense phenomena of interest. Subsequently, sensed data are collected, aggregated and analysed to extract useful information. Sparse Mobile Crowd Sensing (SMCS) aims at reducing the sensing overhead (e.g., battery consumption, incentive cost, etc.) by lowering the number of sensing tasks performed. Sensed data thus collected are used to infer missing values. However, it must be ensured that user’s private information (e.g., user’s home location) cannot be derived from the sensed data shared by a user. We propose a novel approach entitled ‘Privacy Aware Missing Data Inference Scheme for Sparse Mobile Crowd Sensing (PAMDI)’ which employs the concept of perceptual hash for ensuring privacy while trying to maintain performance guarantees. Simulation results with the help of two real-world data-sets point towards the feasibility of the proposed approach for provisioning user privacy. We use regression algorithms for missing data inference in PAMDI and find that linear regression algorithms work best with the proposed privacy approach as compared to non-linear regression algorithms. Moreover, we observe that inference accuracy is more or less maintained even after introducing privacy with the proposed approach. In particular, for the first data-set (Temperature data-set), the mean absolute error (MAE) and root mean squared error (RMSE) values obtained by the linear algorithms using the proposed approach are about 2.65 ∘ C and 2 . 9 ∘ C respectively. On the other hand, the corresponding MAE and RMSE values generated by the linear algorithms when no privacy is introduced are about 2.25 ∘ C and 2.85 ∘ C respectively. For non-linear algorithms, the corresponding error values are higher. We also observe the same trend in the results of the second data-set.
期刊介绍:
The Journal of Ambient Intelligence and Smart Environments (JAISE) serves as a forum to discuss the latest developments on Ambient Intelligence (AmI) and Smart Environments (SmE). Given the multi-disciplinary nature of the areas involved, the journal aims to promote participation from several different communities covering topics ranging from enabling technologies such as multi-modal sensing and vision processing, to algorithmic aspects in interpretive and reasoning domains, to application-oriented efforts in human-centered services, as well as contributions from the fields of robotics, networking, HCI, mobile, collaborative and pervasive computing. This diversity stems from the fact that smart environments can be defined with a variety of different characteristics based on the applications they serve, their interaction models with humans, the practical system design aspects, as well as the multi-faceted conceptual and algorithmic considerations that would enable them to operate seamlessly and unobtrusively. The Journal of Ambient Intelligence and Smart Environments will focus on both the technical and application aspects of these.