A Privacy-Preserving Solution for the Bipartite Ranking Problem

N. Faramarzi, Erman Ayday, H. Altay Güvenir
{"title":"A Privacy-Preserving Solution for the Bipartite Ranking Problem","authors":"N. Faramarzi, Erman Ayday, H. Altay Güvenir","doi":"10.1109/ICMLA.2016.0067","DOIUrl":null,"url":null,"abstract":"In this paper, we propose an efficient solution for the privacy-preserving of a bipartite ranking algorithm. The bipartite ranking problem can be considered as finding a function that ranks positive instances (in a dataset) higher than the negative ones. However, one common concern for all the existing schemes is the privacy of individuals in the dataset. That is, one (e.g., a researcher) needs to access the records of all individuals in the dataset in order to run the algorithm. This privacy concern puts limitations on the use of sensitive personal data for such analysis. The RIMARC (Ranking Instances by Maximizing Area under the ROC Curve) algorithm solves the bipartite ranking problem by learning a model to rank instances. As part of the model, it learns weights for each feature by analyzing the area under receiver operating characteristic (ROC) curve. RIMARC algorithm is shown to be more accurate and efficient than its counterparts. Thus, we use this algorithm as a building-block and provide a privacy-preserving version of the RIMARC algorithm using homomorphic encryption and secure multi-party computation. Our proposed algorithm lets a data owner outsource the storage and processing of its encrypted dataset to a semi-trusted cloud. Then, a researcher can get the results of his/her queries (to learn the ranking function) on the dataset by interacting with the cloud. During this process, neither the researcher nor the cloud learns any information about the raw dataset. We prove the security of the proposed algorithm and show its efficiency via experiments on real data.","PeriodicalId":356182,"journal":{"name":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2016.0067","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In this paper, we propose an efficient solution for the privacy-preserving of a bipartite ranking algorithm. The bipartite ranking problem can be considered as finding a function that ranks positive instances (in a dataset) higher than the negative ones. However, one common concern for all the existing schemes is the privacy of individuals in the dataset. That is, one (e.g., a researcher) needs to access the records of all individuals in the dataset in order to run the algorithm. This privacy concern puts limitations on the use of sensitive personal data for such analysis. The RIMARC (Ranking Instances by Maximizing Area under the ROC Curve) algorithm solves the bipartite ranking problem by learning a model to rank instances. As part of the model, it learns weights for each feature by analyzing the area under receiver operating characteristic (ROC) curve. RIMARC algorithm is shown to be more accurate and efficient than its counterparts. Thus, we use this algorithm as a building-block and provide a privacy-preserving version of the RIMARC algorithm using homomorphic encryption and secure multi-party computation. Our proposed algorithm lets a data owner outsource the storage and processing of its encrypted dataset to a semi-trusted cloud. Then, a researcher can get the results of his/her queries (to learn the ranking function) on the dataset by interacting with the cloud. During this process, neither the researcher nor the cloud learns any information about the raw dataset. We prove the security of the proposed algorithm and show its efficiency via experiments on real data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
二部排序问题的隐私保护解
在本文中,我们提出了一个二部排序算法的隐私保护的有效解决方案。二部排序问题可以被认为是找到一个函数,该函数将数据集中的正实例排序高于负实例。然而,所有现有方案的一个共同问题是数据集中个人的隐私。也就是说,一个人(例如,研究人员)需要访问数据集中所有个人的记录才能运行算法。出于对隐私的考虑,对使用敏感个人资料进行此类分析施加了限制。RIMARC(通过最大化ROC曲线下的面积来排序实例)算法通过学习一个模型对实例进行排序来解决二部排序问题。作为模型的一部分,它通过分析接收者工作特征(ROC)曲线下的面积来学习每个特征的权重。结果表明,RIMARC算法比同类算法更准确、更高效。因此,我们使用该算法作为构建块,并使用同态加密和安全多方计算提供了RIMARC算法的隐私保护版本。我们提出的算法允许数据所有者将其加密数据集的存储和处理外包给半可信的云。然后,研究人员可以通过与云交互,在数据集上获得他/她的查询结果(以学习排名函数)。在这个过程中,研究人员和云都没有学习到关于原始数据集的任何信息。通过对实际数据的实验,证明了该算法的安全性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Prediction Modelling and Pattern Detection Approach for the First-Episode Psychosis Associated to Cannabis Use An Effective and Efficient Similarity-Matrix-Based Algorithm for Clustering Big Mobile Social Data Time Series Classification Using Time Warping Invariant Echo State Networks Improved Time Series Classification with Representation Diversity and SVM Android Malware Detection: Building Useful Representations
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1