AIセキュリティポータル K Program
Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models
Share
Abstract
Web-based Large Language Model (LLM) services have been widely adopted and have become an integral part of our Internet experience. Third-party plugins enhance the functionalities of LLM by enabling access to real-world data and services. However, the privacy consequences associated with these services and their third-party plugins are not well understood. Sensitive prompt data are stored, processed, and shared by cloud-based LLM providers and third-party plugins. In this paper, we propose Casper, a prompt sanitization technique that aims to protect user privacy by detecting and removing sensitive information from user inputs before sending them to LLM services. Casper runs entirely on the user's device as a browser extension and does not require any changes to the online LLM services. At the core of Casper is a three-layered sanitization mechanism consisting of a rule-based filter, a Machine Learning (ML)-based named entity recognizer, and a browser-based local LLM topic identifier. We evaluate Casper on a dataset of 4000 synthesized prompts and show that it can effectively filter out Personal Identifiable Information (PII) and privacy-sensitive topics with high accuracy, at 98.5% and 89.9%, respectively.
Attention is all you need
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, I. Polosukhin
Published: 2017
On the privacy risk of in-context learning
H. Duan, A. Dziedzic, M. Yaghini, N. Papernot, F. Boenisch
Published: 2023
MegaMind: A Platform for Security & Privacy Extensions for Voice Assistants
S. M. Seyed Talebi, A. Amiri Sani, S. Saroiu, A. Wolman
Published: 2021
Slack Privacy Principles: Search, Learning and Artificial Intelligence
Published: 2024
Role of chat gpt in public health
Som S. Biswas
Published: 2023
The opportunities and challenges of chatgpt in education
I. Adeshola, A. P. Adepoju
Published: 2023
Deep learning with differential privacy
M. Abadi, A. Chu, I. Goodfellow, H. B. McMahan, I. Mironov, K. Talwar, L. Zhang
Published: 2016
Communication-Efficient Learning of Deep Networks from Decentralized Data
H. Brendan McMahan, Eider Moore, Daniel Ramage, Seth Hampson, Blaise Agüera y Arcas
Published: 2016.2.18
Why johnny can’t prompt: how non-ai experts try (and fail) to design llm prompts
J. D. Zamfirescu-Pereira, R. Y. Wong, B. Hartmann, Q. Yang
Published: 2023
How to write effective prompts for large language models
Z. Lin
Published: 2024
Model inversion attacks that exploit confidence information and basic countermeasures
Matt Fredrikson, Somesh Jha, Thomas Ristenpart
Published: 2015
Competitive targeted advertising with price discrimination
R. Esteves, J. Resende
Published: 2016
Search and price discrimination online
E. Mauring
Published: 2021
Privacy expectations and preferences in an {IoT} world
P. E. Naeini, S. Bhagavatula, H. Habib, M. Degeling, L. Bauer, L. F. Cranor, N. Sadeh
Published: 2017
Hey alexa, is this skill safe?: Taking a closer look at the alexa skill ecosystem
C. Lentzsch, Sheel J. Shah, B. Andow, M. Degeling, A. Das, W. Enck
Published: 2021
Privacy in the amazon alexa skills ecosystem
A. Alhadlaq, J. Tang, M. Almaymoni, A. Korolova
Published: 2017
Measuring alexa skill privacy practices across three years
J. Edu, X. Ferrer-Aran, J. Such, G. Suarez-Tangil
Published: 2022
Chinese named entity recognition method based on bert
Y. Chang, L. Kong, K. Jia, Q. Meng
Published: 2021
Neural dynamics of semantic composition
B. Lyu, H. S. Choi, W. D. Marslen-Wilson, A. Clarke, B. Randall, L. K. Tyler
Published: 2019
Innovative instructions and software model for isolated execution.
F. McKeen, I. Alexandrovich, A. Berenzon, C. V. Rozas, H. Shafi, V. Shanbhogue, U. R. Savagaonkar
Published: 2013
SGXIO: Generic Trusted I/O Path for Intel SGX
S. Weiser, M. Werner
Published: 2017
Minimizing a smartphone’s tcb for security-critical programs with exclusively-used, physically-isolated, statically-partitioned hardware
Z. Yao, S. M. Seyed Talebi, M. Chen, A. Amiri Sani, T. Anderson
Published: 2023
Graviton: Trusted execution environments on gpus
Stavros Volos, Kapil Vaswani, Rodrigo Bruno
Published: 2018
ShadowNet: A Secure and Efficient On-device Model Inference System for Convolutional Neural Networks
Zhichuang Sun, Ruimin Sun, Changming Liu, Amrita Roy Chowdhury, Long Lu, Somesh Jha
Published: 2020.11.12
GLeeFuzz: Fuzzing WebGL Through Error Message Guided Mutation
H. Peng, Z. Yao, A. Amiri Sani, D. Tian, M. Payer
Published: 2021
Sugar: Secure GPU Acceleration in Web Browsers
Z. Yao, Z. Ma, Y. Liu, A. Amiri Sani, A. Chandramowlishwaran
Published: 2018
Android: Protecting the Kernel
J. Vander Stoep
Published: 2016
Undo Workarounds for Kernel Bugs
S. M. Seyed Talebi, Z. Yao, A. Amiri Sani, Z. Qian, D. Austin
Published: 2021
Milkomeda: Safeguarding the Mobile GPU Interface Using WebGL Security Checks
Z. Yao, S. Mirzamohammadi, A. Amiri Sani, M. Payer
Published: 2018
Methods and systems for vetted secure access of a remote desktop utilizing contextual application information
S. Liu, Z. Yao
Published: 2023
Fully homomorphic encryption using ideal lattices
C. Gentry
Published: 2009
Cryptonets: Applying neural networks to encrypted data with high throughput and accuracy
Ran Gilad-Bachrach, Nathan Dowlin, Kim Laine, Kristin Lauter, Michael Naehrig, John Wensing
Published: 2016
Towards building the federated gpt: Federated instruction tuning
Zhang, J., Vahidian, S., Kuo, M., Li, C., Zhang, R., Wang, G., Chen, Y.
Published: 2023
Nonlinear pattern matching in rule-based modeling languages
T. Warnke, A. M. Uhrmacher
Published: 2021
Efficient string matching: an aid to bibliographic search
A. V. Aho, M. J. Corasick
Published: 1975
Share