How data-sharing nudges influence people's privacy preferences: A machine learning-based analysis

EAI Endorsed Trans. Security Safety Pub Date : 2021-12-21 DOI:10.4108/eai.21-12-2021.172440

Yang Lu, Shujun Li, A. Freitas, A. Ioannou

{"title":"How data-sharing nudges influence people's privacy preferences: A machine learning-based analysis","authors":"Yang Lu, Shujun Li, A. Freitas, A. Ioannou","doi":"10.4108/eai.21-12-2021.172440","DOIUrl":null,"url":null,"abstract":"INTRODUCTION: Many online services use data-sharing nudges to solicit personal data from their customers for personalized services. OBJECTIVES: This study aims to study people’s privacy preferences in sharing di ff erent types of personal data under di ff erent nudging conditions, how digital nudging can change their data sharing willingness, and if people’s data sharing preferences can be predicted using their responses to a questionnaire. METHODS: This paper reports a machine learning-based analysis on people’s privacy preference patterns under four di ff erent data-sharing nudging conditions (without nudging, monetary incentives, non-monetary incentives, and privacy assurance). The analysis is based on data collected from 685 UK residents who participated in a panel survey. Their self-reported willingness levels towards sharing 23 di ff erent types of personal data were analyzed by using both unsupervised (clustering) and supervised (classification) machine learning algorithms. RESULTS: The results led to a better understanding of people’s privacy preference patterns across di ff erent data-sharing nudging conditions, e.g., our participants’ preferences are distributed in a space of 48 possible profiles more sparsely than we expected, and the unexpected observation that all the three data-sharing nudging strategies led to an overall negative e ff ect: they led to a reduced level of self-reported willingness for more participants, comparing with the case of no nudging at all. Our experiments with supervised machine learning models also showed that people’s privacy (data-sharing) preference profiles can be automatically predicted with a good accuracy, even when a small questionnaire with just seven questions is used. CONCLUSION: Our work revealed a more complicated structure of people’s privacy preference profiles, which have some dependencies on the type of data nudging and the type of personal data shared. Such complicated privacy preference profiles can be e ff ectively analyzed using machine learning methods, including automatic prediction based on a small questionnaire. The negative results on the overall e ff ect of di ff erent data-sharing nudges imply that service providers should consider if and how to use such mechanisms to incentivise their consumers to share personal data. We believe that more consumer-centric and transparent methods and tools should be used to help improve trust between consumers and service providers.","PeriodicalId":335727,"journal":{"name":"EAI Endorsed Trans. Security Safety","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Trans. Security Safety","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/eai.21-12-2021.172440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

INTRODUCTION: Many online services use data-sharing nudges to solicit personal data from their customers for personalized services. OBJECTIVES: This study aims to study people’s privacy preferences in sharing di ff erent types of personal data under di ff erent nudging conditions, how digital nudging can change their data sharing willingness, and if people’s data sharing preferences can be predicted using their responses to a questionnaire. METHODS: This paper reports a machine learning-based analysis on people’s privacy preference patterns under four di ff erent data-sharing nudging conditions (without nudging, monetary incentives, non-monetary incentives, and privacy assurance). The analysis is based on data collected from 685 UK residents who participated in a panel survey. Their self-reported willingness levels towards sharing 23 di ff erent types of personal data were analyzed by using both unsupervised (clustering) and supervised (classification) machine learning algorithms. RESULTS: The results led to a better understanding of people’s privacy preference patterns across di ff erent data-sharing nudging conditions, e.g., our participants’ preferences are distributed in a space of 48 possible profiles more sparsely than we expected, and the unexpected observation that all the three data-sharing nudging strategies led to an overall negative e ff ect: they led to a reduced level of self-reported willingness for more participants, comparing with the case of no nudging at all. Our experiments with supervised machine learning models also showed that people’s privacy (data-sharing) preference profiles can be automatically predicted with a good accuracy, even when a small questionnaire with just seven questions is used. CONCLUSION: Our work revealed a more complicated structure of people’s privacy preference profiles, which have some dependencies on the type of data nudging and the type of personal data shared. Such complicated privacy preference profiles can be e ff ectively analyzed using machine learning methods, including automatic prediction based on a small questionnaire. The negative results on the overall e ff ect of di ff erent data-sharing nudges imply that service providers should consider if and how to use such mechanisms to incentivise their consumers to share personal data. We believe that more consumer-centric and transparent methods and tools should be used to help improve trust between consumers and service providers.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

数据共享如何影响人们的隐私偏好:基于机器学习的分析

导读:许多在线服务使用数据共享的方式向客户索取个人数据，以提供个性化服务。目的:本研究旨在研究人们在不同助推条件下分享不同类型个人数据的隐私偏好，数字助推如何改变他们的数据共享意愿，以及是否可以通过他们对问卷的回答来预测人们的数据共享偏好。方法:本文基于机器学习分析了四种不同数据共享助推条件下(无助推、货币激励、非货币激励和隐私保障)人们的隐私偏好模式。该分析基于685名参与小组调查的英国居民收集的数据。通过使用无监督(聚类)和监督(分类)机器学习算法，分析了他们自我报告的分享23种不同类型个人数据的意愿水平。结果:结果使我们更好地理解了不同数据共享助推条件下人们的隐私偏好模式，例如，我们的参与者的偏好分布在48个可能的配置文件的空间中，比我们预期的更稀疏，并且意外地观察到所有三种数据共享助推策略都导致了整体的负影响。与完全没有轻推的情况相比，轻推导致更多参与者的自我报告意愿水平降低。我们对监督机器学习模型的实验也表明，即使使用只有七个问题的小问卷，人们的隐私(数据共享)偏好配置文件也可以以很高的准确性自动预测。结论:我们的研究揭示了人们的隐私偏好结构更为复杂，这与数据推送的类型和共享的个人数据类型有一定的关系。这种复杂的隐私偏好配置文件可以使用机器学习方法进行有效分析，包括基于小问卷的自动预测。不同的数据共享推动对整体效果的负面影响意味着，服务提供商应该考虑是否以及如何使用这些机制来激励其消费者共享个人数据。我们认为，应该使用更多以消费者为中心和透明的方法和工具来帮助提高消费者和服务提供商之间的信任。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

EAI Endorsed Trans. Security Safety

自引率

0.00%

发文量