Data streams and privacy: Two emerging issues in data classification

Radhika Kotecha, Sanjay Garg
{"title":"Data streams and privacy: Two emerging issues in data classification","authors":"Radhika Kotecha, Sanjay Garg","doi":"10.1109/NUICONE.2015.7449597","DOIUrl":null,"url":null,"abstract":"Several real-world applications generate data streams where the opportunity to examine each instance is concise. Effective classification of such data streams is an emerging issue in data mining. However, such classification can cause severe threats to privacy. There are several applications like credit card fraud detection, disease outbreak or biological attack detection, loan approval, etc. where the data is homogeneously distributed among different parties. These parties may wish to collaboratively build a classifier to obtain certain global patterns but will be reluctant to disclose their private data. Privacy-preserving classification of such homogeneously distributed data is a challenging issue too. In this paper, we present a brief review of the work carried out in data stream classification and privacy-preserving classification of homogeneously distributed data; followed by an empirical evaluation and performance comparison of some methods in both these areas. We also propose and evaluate an approach of creating an ensemble of anonymous decision trees to classify homogeneously distributed data in a privacy-preserving manner. We further identify the need to develop efficient methods for privacy-preserving classification of homogeneously distributed data streams and propose a suitable approach for the same.","PeriodicalId":131332,"journal":{"name":"2015 5th Nirma University International Conference on Engineering (NUiCONE)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 5th Nirma University International Conference on Engineering (NUiCONE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NUICONE.2015.7449597","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Several real-world applications generate data streams where the opportunity to examine each instance is concise. Effective classification of such data streams is an emerging issue in data mining. However, such classification can cause severe threats to privacy. There are several applications like credit card fraud detection, disease outbreak or biological attack detection, loan approval, etc. where the data is homogeneously distributed among different parties. These parties may wish to collaboratively build a classifier to obtain certain global patterns but will be reluctant to disclose their private data. Privacy-preserving classification of such homogeneously distributed data is a challenging issue too. In this paper, we present a brief review of the work carried out in data stream classification and privacy-preserving classification of homogeneously distributed data; followed by an empirical evaluation and performance comparison of some methods in both these areas. We also propose and evaluate an approach of creating an ensemble of anonymous decision trees to classify homogeneously distributed data in a privacy-preserving manner. We further identify the need to develop efficient methods for privacy-preserving classification of homogeneously distributed data streams and propose a suitable approach for the same.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
数据流和隐私:数据分类中的两个新问题
几个实际应用程序生成数据流,其中检查每个实例的机会很简洁。数据流的有效分类是数据挖掘中的一个新兴问题。然而,这种分类可能会对隐私造成严重威胁。有一些应用,如信用卡欺诈检测,疾病爆发或生物攻击检测,贷款审批等,其中数据均匀地分布在不同的当事方之间。这些各方可能希望协作构建一个分类器来获得某些全局模式,但不愿公开他们的私有数据。对这种均匀分布的数据进行隐私保护分类也是一个具有挑战性的问题。在本文中,我们简要回顾了在数据流分类和同质分布数据的隐私保护分类方面所做的工作;然后对这两个领域的一些方法进行了实证评价和性能比较。我们还提出并评估了一种创建匿名决策树集合的方法,以保护隐私的方式对均匀分布的数据进行分类。我们进一步确定需要开发有效的方法来保护同质分布数据流的隐私,并提出了一种合适的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Brain computer interface: A review A comparative study of various community detection algorithms in the mobile social network TCP with sender assisted delayed acknowledgement — A novel ACK thinning scheme Data streams and privacy: Two emerging issues in data classification ANFIS as a controller for fractional order system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1