从未清理数据中提取的知识在应用于清理数据时的效用

2008 Sixth Annual Conference on Privacy, Security and Trust Pub Date : 2008-10-01 DOI:10.1109/PST.2008.30

Michal Sramka, R. Safavi-Naini, J. Denzinger, Mina Askari, Jie Gao

{"title":"从未清理数据中提取的知识在应用于清理数据时的效用","authors":"Michal Sramka, R. Safavi-Naini, J. Denzinger, Mina Askari, Jie Gao","doi":"10.1109/PST.2008.30","DOIUrl":null,"url":null,"abstract":"Knowledge discovery systems extract knowledge from data that can be used for making prediction about incomplete data items. Utility is a measure of the usefulness of the discovered knowledge and satisfaction of the user with that knowledge. We motivate and address the question of usefulness of sanitized data using the notion of utility in data mining systems. For this we measure the success of patterns and rules discovered from the original data to make predictions about the sanitized data using a previously developed framework. Using experimental results on a set of medical data we demonstrate that it is possible to make useful predictions about the sanitized medical data when rules discovered from the original unsanitized medical data are used. We explain our results and compare it with the case where no sanitization is involved.","PeriodicalId":422934,"journal":{"name":"2008 Sixth Annual Conference on Privacy, Security and Trust","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Utility of Knowledge Extracted from Unsanitized Data when Applied to Sanitized Data\",\"authors\":\"Michal Sramka, R. Safavi-Naini, J. Denzinger, Mina Askari, Jie Gao\",\"doi\":\"10.1109/PST.2008.30\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Knowledge discovery systems extract knowledge from data that can be used for making prediction about incomplete data items. Utility is a measure of the usefulness of the discovered knowledge and satisfaction of the user with that knowledge. We motivate and address the question of usefulness of sanitized data using the notion of utility in data mining systems. For this we measure the success of patterns and rules discovered from the original data to make predictions about the sanitized data using a previously developed framework. Using experimental results on a set of medical data we demonstrate that it is possible to make useful predictions about the sanitized medical data when rules discovered from the original unsanitized medical data are used. We explain our results and compare it with the case where no sanitization is involved.\",\"PeriodicalId\":422934,\"journal\":{\"name\":\"2008 Sixth Annual Conference on Privacy, Security and Trust\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Sixth Annual Conference on Privacy, Security and Trust\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PST.2008.30\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Sixth Annual Conference on Privacy, Security and Trust","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PST.2008.30","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

摘要

知识发现系统从数据中提取知识，用于对不完整的数据项进行预测。效用是对所发现知识的有用性和用户对该知识的满意度的度量。我们使用数据挖掘系统中的效用概念来激发和解决净化数据的有用性问题。为此，我们衡量从原始数据中发现的模式和规则的成功程度，以便使用先前开发的框架对经过处理的数据进行预测。通过对一组医疗数据的实验结果，我们证明，当使用从原始未消毒医疗数据中发现的规则时，可以对消毒后的医疗数据做出有用的预测。我们解释我们的结果，并将其与不涉及消毒的情况进行比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Utility of Knowledge Extracted from Unsanitized Data when Applied to Sanitized Data

Knowledge discovery systems extract knowledge from data that can be used for making prediction about incomplete data items. Utility is a measure of the usefulness of the discovered knowledge and satisfaction of the user with that knowledge. We motivate and address the question of usefulness of sanitized data using the notion of utility in data mining systems. For this we measure the success of patterns and rules discovered from the original data to make predictions about the sanitized data using a previously developed framework. Using experimental results on a set of medical data we demonstrate that it is possible to make useful predictions about the sanitized medical data when rules discovered from the original unsanitized medical data are used. We explain our results and compare it with the case where no sanitization is involved.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2008 Sixth Annual Conference on Privacy, Security and Trust

自引率

0.00%

发文量

期刊最新文献

Incorporating Privacy Outcomes: Teaching an Old Dog New Tricks LogView: Visualizing Event Log Clusters Unlinkable Communication The Uncertainty of the Truth Model-Checking for Software Vulnerabilities Detection with Multi-Language Support