面向隐私的多变量时间序列分类特征选择

Procedia Computer Science Pub Date : 2024-01-01 Epub Date: 2024-11-28 DOI:10.1016/j.procs.2024.09.430
Adrian-Silviu Roman , Béla Genge , Roland Bolboacă
{"title":"面向隐私的多变量时间序列分类特征选择","authors":"Adrian-Silviu Roman ,&nbsp;Béla Genge ,&nbsp;Roland Bolboacă","doi":"10.1016/j.procs.2024.09.430","DOIUrl":null,"url":null,"abstract":"<div><div>The widespread use of sensors in various domains such as automotive or healthcare has greatly increased the amount of collected data, leading to important privacy issues. Although the sensor data does not directly identify individuals, it can still reveal sensitive information. Addressing privacy in the context of Time Series Classification (TSC) presents challenges, including the need to balance data utility and privacy protection. The current study introduces a novel privacy-oriented feature selection methodology for TSC, aiming to improve data privacy while preserving utility. We propose a dual-model approach, leveraging two classifiers with opposing objectives: a Utility-Focused Classifier (UFC) and a Privacy-Breaking Classifier (PBC). The methodology introduces the Importance Difference Score (IDS) for feature ranking with the objective of selecting features important for the UFC while removing the features essential for the PBC. The approach includes two feature clustering techniques, one based on IDS and the other on K-means clustering, to optimize the feature selection process. The experiments performed on two driving datasets and two Human Activity Recognition (HAR) datasets, evaluate the effectiveness in reducing the accuracy of potential adversarial classifiers while maintaining appropriate levels of utility. We contribute to the state-of-the-art by offering a configurable framework for feature selection to balance the privacy and utility of the data in TSC.</div></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"246 ","pages":"Pages 500-509"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Privacy-Oriented Feature Selection for Multivariate Time Series Classification\",\"authors\":\"Adrian-Silviu Roman ,&nbsp;Béla Genge ,&nbsp;Roland Bolboacă\",\"doi\":\"10.1016/j.procs.2024.09.430\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The widespread use of sensors in various domains such as automotive or healthcare has greatly increased the amount of collected data, leading to important privacy issues. Although the sensor data does not directly identify individuals, it can still reveal sensitive information. Addressing privacy in the context of Time Series Classification (TSC) presents challenges, including the need to balance data utility and privacy protection. The current study introduces a novel privacy-oriented feature selection methodology for TSC, aiming to improve data privacy while preserving utility. We propose a dual-model approach, leveraging two classifiers with opposing objectives: a Utility-Focused Classifier (UFC) and a Privacy-Breaking Classifier (PBC). The methodology introduces the Importance Difference Score (IDS) for feature ranking with the objective of selecting features important for the UFC while removing the features essential for the PBC. The approach includes two feature clustering techniques, one based on IDS and the other on K-means clustering, to optimize the feature selection process. The experiments performed on two driving datasets and two Human Activity Recognition (HAR) datasets, evaluate the effectiveness in reducing the accuracy of potential adversarial classifiers while maintaining appropriate levels of utility. We contribute to the state-of-the-art by offering a configurable framework for feature selection to balance the privacy and utility of the data in TSC.</div></div>\",\"PeriodicalId\":20465,\"journal\":{\"name\":\"Procedia Computer Science\",\"volume\":\"246 \",\"pages\":\"Pages 500-509\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Procedia Computer Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1877050924024700\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/28 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877050924024700","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/28 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

传感器在汽车或医疗保健等各个领域的广泛使用大大增加了收集的数据量,导致了重要的隐私问题。虽然传感器数据不能直接识别个人,但它仍然可以泄露敏感信息。在时间序列分类(TSC)的背景下处理隐私问题带来了挑战,包括需要平衡数据效用和隐私保护。本研究提出了一种新的面向隐私的TSC特征选择方法,旨在提高数据隐私性的同时保持实用性。我们提出了一种双模型方法,利用两个目标相反的分类器:一个以实用为中心的分类器(UFC)和一个隐私破坏分类器(PBC)。该方法引入了重要性差异评分(IDS)来进行特征排名,目的是选择对UFC重要的特征,同时去除对PBC重要的特征。该方法包括两种特征聚类技术,一种基于IDS,另一种基于K-means聚类,以优化特征选择过程。在两个驾驶数据集和两个人类活动识别(HAR)数据集上进行的实验,评估了在保持适当效用水平的同时降低潜在对抗性分类器准确性的有效性。我们通过提供一个可配置的特征选择框架来平衡TSC中数据的私密性和实用性,从而为最先进的技术做出贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Privacy-Oriented Feature Selection for Multivariate Time Series Classification
The widespread use of sensors in various domains such as automotive or healthcare has greatly increased the amount of collected data, leading to important privacy issues. Although the sensor data does not directly identify individuals, it can still reveal sensitive information. Addressing privacy in the context of Time Series Classification (TSC) presents challenges, including the need to balance data utility and privacy protection. The current study introduces a novel privacy-oriented feature selection methodology for TSC, aiming to improve data privacy while preserving utility. We propose a dual-model approach, leveraging two classifiers with opposing objectives: a Utility-Focused Classifier (UFC) and a Privacy-Breaking Classifier (PBC). The methodology introduces the Importance Difference Score (IDS) for feature ranking with the objective of selecting features important for the UFC while removing the features essential for the PBC. The approach includes two feature clustering techniques, one based on IDS and the other on K-means clustering, to optimize the feature selection process. The experiments performed on two driving datasets and two Human Activity Recognition (HAR) datasets, evaluate the effectiveness in reducing the accuracy of potential adversarial classifiers while maintaining appropriate levels of utility. We contribute to the state-of-the-art by offering a configurable framework for feature selection to balance the privacy and utility of the data in TSC.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.50
自引率
0.00%
发文量
0
期刊最新文献
Contents Contents Contents Contents Preface
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1