{"title":"基于分区的缺失数据流匿名化增量边缘化算法","authors":"Ankhbayar Otgonbayar, Zeeshan Pervez, K. Dahal","doi":"10.1109/SKIMA47702.2019.8982399","DOIUrl":null,"url":null,"abstract":"The IoT and its applications are the inseparable part of modern world. IoT is expanding into every corner of the world where internet is available. IoT data streams are utilized by many organizations for research and business. To benefit from these data streams, the data handling party must secure the individuals’ privacy. The most common privacy preservation approach is data anonymization. However, IoT data provides missing data streams due to the varying device pool and preferences of individuals and unpredicted devices’ malfunctions of IoT. Minimization of missingess and information loss is very important for anonymizing of missing data streams. To achieve this, we introduce IncrementalPBM (Incremental Partitioning Based Marginalization) for anonymizing missig data streams. IncrementalPBM utilizes time based sliding window for missing data stream anonymization, and it aims to control the number of QIDs for anonymization while increasing the number of tuples for anonymization. Our experiment on real dataset showed IncrementalPBM is effective and efficient for anonymizing missing data streams compared to existing missing data stream anonymization algorithm. IncrementalPBM showed significant improvement; 5% to 9% less information loss, 4500 to 6000 more number of re-use anonymization while showing comparable clustering, suppression and runtime.","PeriodicalId":245523,"journal":{"name":"2019 13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Partitioning based incremental marginalization algorithm for anonymizing missing data streams\",\"authors\":\"Ankhbayar Otgonbayar, Zeeshan Pervez, K. Dahal\",\"doi\":\"10.1109/SKIMA47702.2019.8982399\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The IoT and its applications are the inseparable part of modern world. IoT is expanding into every corner of the world where internet is available. IoT data streams are utilized by many organizations for research and business. To benefit from these data streams, the data handling party must secure the individuals’ privacy. The most common privacy preservation approach is data anonymization. However, IoT data provides missing data streams due to the varying device pool and preferences of individuals and unpredicted devices’ malfunctions of IoT. Minimization of missingess and information loss is very important for anonymizing of missing data streams. To achieve this, we introduce IncrementalPBM (Incremental Partitioning Based Marginalization) for anonymizing missig data streams. IncrementalPBM utilizes time based sliding window for missing data stream anonymization, and it aims to control the number of QIDs for anonymization while increasing the number of tuples for anonymization. Our experiment on real dataset showed IncrementalPBM is effective and efficient for anonymizing missing data streams compared to existing missing data stream anonymization algorithm. IncrementalPBM showed significant improvement; 5% to 9% less information loss, 4500 to 6000 more number of re-use anonymization while showing comparable clustering, suppression and runtime.\",\"PeriodicalId\":245523,\"journal\":{\"name\":\"2019 13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SKIMA47702.2019.8982399\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 13th International Conference on Software, Knowledge, Information Management and Applications (SKIMA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SKIMA47702.2019.8982399","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Partitioning based incremental marginalization algorithm for anonymizing missing data streams
The IoT and its applications are the inseparable part of modern world. IoT is expanding into every corner of the world where internet is available. IoT data streams are utilized by many organizations for research and business. To benefit from these data streams, the data handling party must secure the individuals’ privacy. The most common privacy preservation approach is data anonymization. However, IoT data provides missing data streams due to the varying device pool and preferences of individuals and unpredicted devices’ malfunctions of IoT. Minimization of missingess and information loss is very important for anonymizing of missing data streams. To achieve this, we introduce IncrementalPBM (Incremental Partitioning Based Marginalization) for anonymizing missig data streams. IncrementalPBM utilizes time based sliding window for missing data stream anonymization, and it aims to control the number of QIDs for anonymization while increasing the number of tuples for anonymization. Our experiment on real dataset showed IncrementalPBM is effective and efficient for anonymizing missing data streams compared to existing missing data stream anonymization algorithm. IncrementalPBM showed significant improvement; 5% to 9% less information loss, 4500 to 6000 more number of re-use anonymization while showing comparable clustering, suppression and runtime.