Operationalizing Contextual Integrity in Privacy-Conscious Assistants

arXiv - CS - Artificial Intelligence Pub Date : 2024-08-05 DOI:arxiv-2408.02373

Sahra Ghalebikesabi, Eugene Bagdasaryan, Ren Yi, Itay Yona, Ilia Shumailov, Aneesh Pappu, Chongyang Shi, Laura Weidinger, Robert Stanforth, Leonard Berrada, Pushmeet Kohli, Po-Sen Huang, Borja Balle

引用次数: 0

Abstract

Advanced AI assistants combine frontier LLMs and tool access to autonomously perform complex tasks on behalf of users. While the helpfulness of such assistants can increase dramatically with access to user information including emails and documents, this raises privacy concerns about assistants sharing inappropriate information with third parties without user supervision. To steer information-sharing assistants to behave in accordance with privacy expectations, we propose to operationalize $\textit{contextual integrity}$ (CI), a framework that equates privacy with the appropriate flow of information in a given context. In particular, we design and evaluate a number of strategies to steer assistants' information-sharing actions to be CI compliant. Our evaluation is based on a novel form filling benchmark composed of synthetic data and human annotations, and it reveals that prompting frontier LLMs to perform CI-based reasoning yields strong results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

在具有隐私意识的助手中操作情境完整性

先进的人工智能助手结合了前沿 LLM 和工具访问，可代表用户自主执行复杂的任务。虽然这类助手在获取用户信息（包括电子邮件和文档）后能显著提高帮助性，但这也引发了隐私问题，即助手在没有用户监督的情况下与第三方共享不适当的信息。为了引导信息共享助手的行为符合隐私期望，我们提出了$\textit{contextual integrity}$(CI)，这是一个将隐私等同于特定情境下适当信息流的框架。我们的评估基于一个由合成数据和人类注释组成的新颖的表单填写基准，它揭示了促使前沿 LLM 执行基于 CI 的推理会产生强大的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Artificial Intelligence

自引率

0.00%

发文量